/ Projects

Things I've built.

01

RAG & LLM Benchmarking - English and Nepali

Benchmarked local sub-10B LLMs and chunking strategies for RAG pipelines across answer quality, latency, and reliability.

2026
Workflow:
  • Hit rate@k
  • Evidence recall@k
  • Latency
  • Faithfulness
02

AI for Business Insights - Text-to-SQL Analytics System

Production-ready text-to-SQL system that converts natural-language questions into validated SQL and business insights.

2025
Workflow:
  • Structured business data stored in PostgreSQL
  • Query understanding to identify relevant tables and fields
  • Top-k table retrieval with fuzzy matching over column values
  • SQL generation and execution for analytical datasets
  • Python sandbox execution for chart generation
03

AI Sales Copilot for B2B Workflows

AI assistant for automating B2B sales communication. Generates personalized outreach, summarizes leads, and prepares follow-ups using CRM context and company data.

2025
Workflow:
  • Lead and CRM data ingestion from internal sources
  • Embedding generation and storage in Qdrant
  • Semantic retrieval of company context and past interactions
  • RAG pipeline for personalized email generation
  • Conversation summarization for lead tracking
04

Text-to-Reels AI Content Generator

Generative AI system for creating short-form videos from text prompts by combining gameplay footage, generated visuals, and AI voiceovers.

2025
Workflow:
  • Gameplay video template selection for the lower screen region
  • LLM-generated narration and content plan from user text
  • Image generation for upper-screen visuals
  • ElevenLabs voiceover generation
  • Automated video assembly into final reel output
05

Nepali Cultural Video Understanding

Vision-language system for captioning and QA on Nepali cultural videos. Multimodal transformers align visual features with Nepali language semantics.

2024
Workflow:
  • Scraped Nepali video data from YouTube
  • Used capable VLMs to generate captions + QA
  • Frame processing with 10 second clips
  • VLM and encoder-decoder architecture
  • Generated final captions and QA outputs
06

Stock Market Volatility Forecasting

Predictive system for estimating market volatility and Value-at-Risk using historical S&P 500 data with hybrid GARCH-LSTM architectures.

2024
Workflow:
  • Collected and preprocessed historical stock data from Yahoo Finance
  • Modeled volatility using GARCH and related variants
  • Integrated hybrid GARCH-LSTM models for short-term forecasting
  • Backtested predictions for risk estimation accuracy
  • Produced Value-at-Risk estimates and decision-support visualizations
07

Sign Language Recognition with Graph Neural Networks

System for recognizing sign language gestures from video using graph neural networks and temporal sequence modeling.

2022
Workflow:
  • Extracted hand and body keypoints from video frames
  • Constructed spatio-temporal graphs for joints and gestures
  • Applied Graph Attention Networks for spatial dependencies
  • Integrated LSTM layers for temporal sequence modeling
  • Predicted sign language gestures on benchmark datasets