Skip to content
Prakhar Kumar Singh

Prakhar Kumar Singh

GenAI Engineer

2.5+ yrs building production LLM systems, RAG pipelines, and agentic AI at scale. Multi-tenant AI chatbot serving 50+ enterprises on AWS Bedrock & Claude. Python, FastAPI, LangChain, LangGraph, MCP, NL-to-SQL.

Serving Notice - Can Join by July 2026

Open to GenAI / ML / LLM / Agentic AI Engineer Roles

I'm an engineer who builds production AI systems that actually ship to real users. Not prototypes. Not demos. Systems running 24/7 across 50+ enterprise tenants, handling 250+ concurrent users, with 99.9% uptime.

At Aurigo Software, I've designed and built the GenAI stack from scratch - a multi-tenant chatbot platform with hybrid RAG retrieval, an agent builder with visual workflow orchestration, an NL-to-SQL engine with circuit-breaker LLM failover, and deep learning models forecasting budgets across 5,000+ infrastructure projects.

My focus is on the intersection of LLMs, retrieval systems, and distributed infrastructure. I care about latency budgets, data isolation, and making AI systems that don't fall over at 2 AM.

0+

Concurrent WebSocket Connections

0%+

Production System Uptime

0%+

RAG Retrieval Relevance

0%

Deep Learning Forecast R²

0%

API Cost Reduction via Caching

0+

Enterprise Tenants Served

01.Things I've Built

02.Where I've Worked

Software Engineer - GenAI & Machine Learning

Jun 2024 – Present · 2 yrs

Aurigo Software Technologies

  • Engineered production multi-tenant AI chatbot on AWS Bedrock (Claude Sonnet 4.5) using Python FastAPI and WebSocket streaming - 250+ concurrent connections across 50+ enterprise tenants with 2-layer caching (Valkey + semantic) cutting latency by 60% and API costs by 40%
  • Built hybrid RAG combining pgvector (1024-dim Titan embeddings) + BM25 via Reciprocal Rank Fusion (RRF), indexing 100K+ chunks in Aurora PostgreSQL - 85%+ retrieval relevance with LLM-based reranking
  • Architected SQS-based async ETL ingesting 1000+ docs/day via PyMuPDF and Docling OCR with ThreadPoolExecutor (12 parallel workers) and adaptive memory scaling - 99.5% ingestion success rate
  • Built NL-to-SQL engine with circuit breaker, Sonnet-to-Haiku failover, sqlglot AST validation for SQL injection prevention, and PII filtering - 500+ queries/day at 99.9% uptime
  • Established RAGAS-aligned LLM evaluation framework (MRR, correctness, groundedness, completeness) with golden test datasets from 100K+ chunks and SQS-based regression testing for continuous drift detection
  • Built TensorFlow/Keras DNN for infrastructure budget forecasting across 5000+ projects - 94% R², 40% error reduction, deployed on Lambda at sub-500ms latency with S3 model storage with SSM Parameter Store versioning
  • Orchestrated 40+ table MWAA Airflow ETL from S3 CSVs and Pendo API into Aurora PostgreSQL across 10+ tenants - 85% failure reduction via intelligent CSV chunking and 30 concurrent parallel batch operations
  • Established S3-based data quality monitoring with config-driven transformations - 99% data accuracy across 10+ customer instances in production
Bravo Award (Jan 2025) - Recognized for exemplifying innovation and simplification in production GenAI systems

Software Engineer Intern

Dec 2023 – May 2024 · 6 mos

Aurigo Software Technologies

  • Built PoC multi-document search chatbot using AWS Kendra and Anthropic Claude - validated retrieval quality on construction project documents
  • Prototyped early RAG-based chatbot for construction document and database search, laying the foundation for the production multi-tenant system
  • Developed sentiment analysis pipeline for construction bidding documents using transformer-based models
  • Built early budget prediction prototype in TensorFlow/Keras - later evolved into the production forecasting system (94% R²)
  • Contributed to multi-tenant S3/Pendo → Aurora PostgreSQL data warehouse ETL on AWS MWAA
  • Converted to full-time Software Engineer in June 2024 based on performance

Education

B.Tech in Computer Science and Engineering

2020 – 2024

Birla Institute of Technology, Ranchi

8.66 CGPA

Std XII

Jun 2017 – May 2019

Chinmaya Vidyalaya, Bokaro Steel City

94.4%

03.Tech I Use

Languages

PythonSQLBash

Frameworks

FastAPIWebSocketLangChainLangGraphLlamaIndexTensorFlowKerasscikit-learnPandasPyTorchHuggingFace

Cloud

AWS BedrockAWS LambdaS3SQSMWAAAuroraECREKSSSMSageMakerDockerKubernetes

Databases

PostgreSQLpgvectorDynamoDBValkeySQL ServerFAISSBM25

ML/AI

Claude Sonnet 4.5GPT-4Titan EmbeddingsRAGMCPBedrock ToolsRAGASNL-to-SQLsqlglotAgentic AIPrompt EngineeringRRF / Hybrid SearchLLMOps

DevOps

Apache AirflowStructlogCI/CDPyMuPDFDocling OCR

04.Get In Touch

I'm currently looking for my next opportunity starting July 2026. Whether you have a question, a role, or just want to say hi - my inbox is open.

prakharrathore111@gmail.com+91-9939792643Bengaluru, India