I'm an engineer who builds production AI systems that actually ship to real users. Not prototypes. Not demos. Systems running 24/7 across 50+ enterprise tenants, handling 250+ concurrent users, with 99.9% uptime.
At Aurigo Software, I've designed and built the GenAI stack from scratch - a multi-tenant chatbot platform with hybrid RAG retrieval, an agent builder with visual workflow orchestration, an NL-to-SQL engine with circuit-breaker LLM failover, and deep learning models forecasting budgets across 5,000+ infrastructure projects.
My focus is on the intersection of LLMs, retrieval systems, and distributed infrastructure. I care about latency budgets, data isolation, and making AI systems that don't fall over at 2 AM.
Concurrent WebSocket Connections
Production System Uptime
RAG Retrieval Relevance
Deep Learning Forecast R²
API Cost Reduction via Caching
Enterprise Tenants Served
01.Things I've Built
02.Where I've Worked
Software Engineer - GenAI & Machine Learning
Jun 2024 – Present · 2 yrsAurigo Software Technologies
- Engineered production multi-tenant AI chatbot on AWS Bedrock (Claude Sonnet 4.5) using Python FastAPI and WebSocket streaming - 250+ concurrent connections across 50+ enterprise tenants with 2-layer caching (Valkey + semantic) cutting latency by 60% and API costs by 40%
- Built hybrid RAG combining pgvector (1024-dim Titan embeddings) + BM25 via Reciprocal Rank Fusion (RRF), indexing 100K+ chunks in Aurora PostgreSQL - 85%+ retrieval relevance with LLM-based reranking
- Architected SQS-based async ETL ingesting 1000+ docs/day via PyMuPDF and Docling OCR with ThreadPoolExecutor (12 parallel workers) and adaptive memory scaling - 99.5% ingestion success rate
- Built NL-to-SQL engine with circuit breaker, Sonnet-to-Haiku failover, sqlglot AST validation for SQL injection prevention, and PII filtering - 500+ queries/day at 99.9% uptime
- Established RAGAS-aligned LLM evaluation framework (MRR, correctness, groundedness, completeness) with golden test datasets from 100K+ chunks and SQS-based regression testing for continuous drift detection
- Built TensorFlow/Keras DNN for infrastructure budget forecasting across 5000+ projects - 94% R², 40% error reduction, deployed on Lambda at sub-500ms latency with S3 model storage with SSM Parameter Store versioning
- Orchestrated 40+ table MWAA Airflow ETL from S3 CSVs and Pendo API into Aurora PostgreSQL across 10+ tenants - 85% failure reduction via intelligent CSV chunking and 30 concurrent parallel batch operations
- Established S3-based data quality monitoring with config-driven transformations - 99% data accuracy across 10+ customer instances in production
Software Engineer Intern
Dec 2023 – May 2024 · 6 mosAurigo Software Technologies
- Built PoC multi-document search chatbot using AWS Kendra and Anthropic Claude - validated retrieval quality on construction project documents
- Prototyped early RAG-based chatbot for construction document and database search, laying the foundation for the production multi-tenant system
- Developed sentiment analysis pipeline for construction bidding documents using transformer-based models
- Built early budget prediction prototype in TensorFlow/Keras - later evolved into the production forecasting system (94% R²)
- Contributed to multi-tenant S3/Pendo → Aurora PostgreSQL data warehouse ETL on AWS MWAA
- Converted to full-time Software Engineer in June 2024 based on performance
Education
B.Tech in Computer Science and Engineering
2020 – 2024Birla Institute of Technology, Ranchi
8.66 CGPAStd XII
Jun 2017 – May 2019Chinmaya Vidyalaya, Bokaro Steel City
94.4%03.Tech I Use
Languages
Frameworks
Cloud
Databases
ML/AI
DevOps
04.Get In Touch
I'm currently looking for my next opportunity starting July 2026. Whether you have a question, a role, or just want to say hi - my inbox is open.


