RAG & LLM Benchmarking - English and Nepali
Benchmarked local sub-10B LLMs and chunking strategies for RAG pipelines across answer quality, latency, and reliability.
- Hit rate@k
- Evidence recall@k
- Latency
- Faithfulness
/ Projects
Benchmarked local sub-10B LLMs and chunking strategies for RAG pipelines across answer quality, latency, and reliability.
Production-ready text-to-SQL system that converts natural-language questions into validated SQL and business insights.
AI assistant for automating B2B sales communication. Generates personalized outreach, summarizes leads, and prepares follow-ups using CRM context and company data.
Generative AI system for creating short-form videos from text prompts by combining gameplay footage, generated visuals, and AI voiceovers.
Vision-language system for captioning and QA on Nepali cultural videos. Multimodal transformers align visual features with Nepali language semantics.
Predictive system for estimating market volatility and Value-at-Risk using historical S&P 500 data with hybrid GARCH-LSTM architectures.
System for recognizing sign language gestures from video using graph neural networks and temporal sequence modeling.