/ Projects

Things I've built.

RAG & LLM Benchmarking - English and Nepali

Benchmarked local sub-10B LLMs and chunking strategies for RAG pipelines across answer quality, latency, and reliability.

2026

Workflow:

Hit rate@k
Evidence recall@k
Latency
Faithfulness

AI for Business Insights - Text-to-SQL Analytics System

Production-ready text-to-SQL system that converts natural-language questions into validated SQL and business insights.

2025

Workflow:

Structured business data stored in PostgreSQL
Query understanding to identify relevant tables and fields
Top-k table retrieval with fuzzy matching over column values
SQL generation and execution for analytical datasets
Python sandbox execution for chart generation

AI Sales Copilot for B2B Workflows

AI assistant for automating B2B sales communication. Generates personalized outreach, summarizes leads, and prepares follow-ups using CRM context and company data.

2025

Workflow:

Lead and CRM data ingestion from internal sources
Embedding generation and storage in Qdrant
Semantic retrieval of company context and past interactions
RAG pipeline for personalized email generation
Conversation summarization for lead tracking

Text-to-Reels AI Content Generator

Generative AI system for creating short-form videos from text prompts by combining gameplay footage, generated visuals, and AI voiceovers.

2025

Workflow:

Gameplay video template selection for the lower screen region
LLM-generated narration and content plan from user text
Image generation for upper-screen visuals
ElevenLabs voiceover generation
Automated video assembly into final reel output

Nepali Cultural Video Understanding

Vision-language system for captioning and QA on Nepali cultural videos. Multimodal transformers align visual features with Nepali language semantics.

2024

Workflow:

Scraped Nepali video data from YouTube
Used capable VLMs to generate captions + QA
Frame processing with 10 second clips
VLM and encoder-decoder architecture
Generated final captions and QA outputs

Stock Market Volatility Forecasting

Predictive system for estimating market volatility and Value-at-Risk using historical S&P 500 data with hybrid GARCH-LSTM architectures.

2024

Workflow:

Collected and preprocessed historical stock data from Yahoo Finance
Modeled volatility using GARCH and related variants
Integrated hybrid GARCH-LSTM models for short-term forecasting
Backtested predictions for risk estimation accuracy
Produced Value-at-Risk estimates and decision-support visualizations

Sign Language Recognition with Graph Neural Networks

System for recognizing sign language gestures from video using graph neural networks and temporal sequence modeling.

2022

Workflow:

Extracted hand and body keypoints from video frames
Constructed spatio-temporal graphs for joints and gestures
Applied Graph Attention Networks for spatial dependencies
Integrated LSTM layers for temporal sequence modeling
Predicted sign language gestures on benchmark datasets