Stephen Bonifacio
Manila, Philippines
- AI Product Engineer / AI Architect building production LLM systems at enterprise scale
- Shipped multi-tenant AI platforms serving 4,000+ users across a large conglomerate
- End-to-end ownership: LLM systems, RAG, tooling, frontend, infra, and observability
- 10+ years designing and delivering mission-critical enterprise software
projects
Full-Stack Enterprise AI Agent
Designed and shipped a production-grade, multi-tenant LLM platform that augmented HR support workflows across multiple companies
- Modular, bring-your-own architecture (LLMs, vector DBs, tools, prompts)
- Async FastAPI backend optimized for concurrent RAG + tool execution
- Multi-tenant Next.js app with real-time streaming, branding, and isolation
- Enterprise-grade security: Entra ID SSO, 2FA, JWTs, encrypted PII storage
- Integrated ServiceNow (RAG + tickets) and SAP HCM (employee data access)
- Full observability with Langfuse (latency, tokens, evals, prompt versions)
Python • TypeScript • FastAPI • Next.js • Azure AI Search • Azure CosmosDB • Redis • Langfuse
Microsoft Teams Integration
Extended the core enterprise LLM platform into Microsoft Teams, enabling employees to access the same AI capabilities directly within their primary collaboration tool
- Implemented a Microsoft Teams bot as an additional frontend channel for the existing LLM platform
- Reused shared backend services, RAG pipelines, tools, and security model
- Built the bot using Microsoft Bot Framework and Express.js
- Integrated with Teams messaging APIs for real-time, bidirectional communication
- Implemented adaptive cards for structured, interactive responses in chat
- Deployed and managed via Azure Bot Service alongside the core platform
TypeScript • Express.js • Agent Framework • Azure Bot Service
Observability Platform for Admins
Built an internal observability and control plane enabling non-technical admins to operate, inspect, and improve production LLM systems
- Created document management interface enabling non-technical users to manage RAG knowledge base
- Built document ingestion pipeline for ServiceNow KM articles with automatic chunking and embedding
- Built conversation explorer with message logs, knowledge gap detection, and RAG pipeline inspection for admin oversight
- Implemented usage analytics dashboard tracking token consumption, tool usage, and peak engagement patterns
- Added ticket integration tracking and dark mode support
TypeScript • Next.js • shadcn/ui • Azure AI Search • Azure CosmosDB • Redis
Usage Analytics & Reporting Engine
Click to view
Designed an automated analytics and reporting engine providing executives and stakeholders with ongoing visibility into AI platform usage and value
- Built a scheduled reporting system delivering monthly AI usage reports via email
- Aggregated and deduplicated conversation data across all tenants and applications
- Generated insights covering cost trends, usage patterns, knowledge gaps, and top users
- Implemented fault-tolerant pipelines with automated error handling and retries
- Enabled month-over-month trend analysis to guide platform investment decisions
Python • Azure Functions • CosmosDB
DevOps & Infrastructure
Built and operated the CI/CD and cloud infrastructure underpinning all production AI applications
- Designed CI/CD pipelines in Azure DevOps for the backend API, web app, Teams bot, and admin app
- Automated build, test, and deployment workflows across dev, staging, and production
- Deployed services to Azure App Service with environment isolation and secrets management
- Standardized release processes to support frequent, low-risk deployments
- Supported rapid iteration while maintaining enterprise reliability requirements
Azure DevOps • Azure Pipelines • Azure App Service • Git • Bash
experience
AI Architect
JG Summit Holdings Inc — Manila, Philippines
2023 – Present
Led the design and delivery of a production LLM platform serving 4,000+ users across multiple companies in a large enterprise group.
- Architected a modular, multi-tenant LLM platform used across business units
- Integrated ServiceNow (Knowledge + Incident) and SAP HCM into AI workflows
- Owned web app, Teams bot, admin tooling, and observability stack
- Reduced HR response times from days to seconds via AI automation
- Balanced latency, cost, and answer quality in multi-hop RAG pipelines
Python • FastAPI • Next.js • TypeScript • Azure AI Foundry • Azure AI Search • Cosmos DB • Langfuse
SAP HCM Solutions Lead & Senior Consultant
JG Summit Holdings, Fujitsu Philippines, NGA HR, Accenture — Philippines
2006 – 2023
Enterprise systems leadership background that directly informed large-scale AI platform design
Led enterprise HR software implementations serving 10,000+ users across 6 of the Philippines' top 30 companies.
- End-to-end HR systems architecture (Payroll, Time Management, LMS, ESS/MSS)
- Executive-level requirements gathering and stakeholder management
- Full-cycle implementations for China Banking (7,000 users), PLDT/Smart (7,000 users)
- Production support with >90% SLA compliance
Career pivoted to AI Product Engineering in 2023 to apply enterprise software expertise to LLM systems.
open source
Autonomous HR Chatbot
434 ⭐ on GitHub
Autonomous HR agent using tools and RAG. Integrates Pinecone, CSVs, Azure Data Lake, and SAP HCM pipelines. Early exploration of agentic workflows (2023) that informed later enterprise production systems.
Python • LangChain • Pinecone • Streamlit • OpenAI API • Azure OpenAI
Model Context Protocol Demo with SSE
Early MCP adoption with SSE and Streamable HTTP. Remote tool integration patterns for Zapier and Gmail.
Python • MCP • FastMCP • SSE • asyncio
AI Assistant for Microsoft 365
LLM assistant integrated with Microsoft 365 (Outlook, Teams, Calendar) to automate tasks and manage communications.
Python • OpenAI Assistants API • Microsoft Graph • Streamlit
Autonomous Mall Assistant
LLM-powered location assistant with fallback recommendations.
Python • LangChain • GPT-4 • pandas • Streamlit
selected writings
Author
Towards AI (1M+ followers)
- Context Engineering Is All You need (2025)
- Langfuse — 6 features that can help supercharge your LLM-powered applications (2024)
- Creating a (mostly) Autonomous HR Assistant with ChatGPT and LangChain's Agents and Tools (2023)Boosted (manually selected by Medium's editors for special promotion)
- Deploy Your Company's Own Secure and Private ChatGPT with Azure OpenAI (2023)
- Chat with Company Documents Using Azure OpenAI (2023)
education
BA, University of the Philippines Diliman, (QS Rank #1, Philippines)
Valedictorian, Bagamanoc Rural Development High School
certifications
- Microsoft Certified: Azure Data Engineer Associate
- SAP Certified Application Associate – HCM (ERP 6.0 EHP4)
- ITIL 2011 Foundation
