•Architected and developed 15+ backend APIs using Django REST Framework (CBVs) to power a customer service call routing platform serving 1,000+ daily calls integrated with Azure Communication Services (ACS).
•Deployed periodic background task scheduling using Celery, Celery Beat, and Redis, processing 500+ automated tasks daily with automated backend triggers via Ngrok-exposed endpoints.
•Built backend logic to fetch, handle, and route live calls from the ACS grid; implemented queue-based call routing reducing customer wait time by 40% based on real-time customer intents and flow configuration.
•Integrated ACS SDK features such as call recording and transcript generation; developed webhooks and WebSocket-based microservices to stream transcripts live to 50+ concurrent users.
•Stored and indexed 10,000+ call transcripts in Pinecone to enable semantic search with 95% accuracy and efficient call-transcript mapping, enhancing post-call insights and retrieval for customer service teams.
Django REST FrameworkAzure Communication Services (ACS)PineconeCeleryRedisMongoDBCosmosDBDockerWebSocketsNgrok
•Led the development of a Legal-AI RAG system with support, using multi-tenant support in Weaviate as the vector database. Added real-time web search features with Tavily and EXA APIs to fetch accurate information. Built AWS S3 integration to handle user document uploads and storage efficiently. Created a pipeline using Unstructured.io to read and break down different types of legal documents into useful sections.
•Architected multi-agent workflows using LangGraph (won internal hackathon worth $200), including dynamic entity extraction and schema mapping
•Developed secure user access with Supabase Auth integrated into a Next.js frontend
•Migrated an AI system from Gradio to FastAPI and NextJs that supports both local and API-based LLMs (OpenAI, Azure, Ollama, Groq), with a hybrid RAG pipeline using vector + full-text search and re-ranking for better results. Added multi-modal QA on documents with text, tables, and images, plus in-browser PDF viewer with citation highlights, relevance scores, and low-confidence warnings. Supported complex questions using breakdown methods and agent-based reasoning (ReAct, ReWOO). Made core retrieval and generation settings.
•Delivered an ultra-fast SQL-aware chatbot achieving average query response times of < 2 seconds for live data analytics using defog-sqlcoder and Groq-LLM
•Enabled contextual schema reasoning and real-time interaction over WebSocket, backed by a secure Supabase data layer
•Followed modular architecture with pluggable LLM and SQL generation modules
•Ensured maintainability by documenting schema-driven query flows and encapsulating prompt logic