About Me
I build AI systems that ship — not just experiments in notebooks. I design and deploy production-grade AI applications end-to-end, from model architecture and fine-tuning to deployed APIs and live demos.
I've built fine-tuned LLMs, multi-agent research pipelines, RAG chatbots, MLOps workflows with drift monitoring, and computer vision systems for real-world safety applications — all served through FastAPI and integrated into live deployments.
I specialise at the intersection of:
NLP & Generative AI
LLMs, RAG systems, LangGraph, LangChain, prompt engineering, Groq & Ollama
LLM Engineering
QLoRA / PEFT fine-tuning, bitsandbytes quantization, HuggingFace Hub deployment
Agentic AI
Multi-agent systems via CrewAI, LangGraph ReAct agents, Tavily tool-use
MLOps & CV
MLflow, W&B, Evidently AI, drift monitoring, CNNs, YOLOv8, FastAPI deployment
Final year B.E. Computer Science (AI & ML) at Lords Institute of Engineering & Technology, Hyderabad — graduating 2026. I stay current with emerging research in LLMs, agentic systems, and MLOps — and I build things to understand them, not just read about them.
Open to AI Engineer, ML Engineer, and MLOps Engineer roles where I can work on production systems that solve real problems.
Technical Skills
My core stack — tools I use to build and ship real AI systems
NLP & Generative AI
Computer Vision & DL
ML Engineering
MLOps & Experiment Tracking
Programming
Tools & Deployment
Featured Projects
AI systems built end-to-end and deployed to production
AI Orchestrator — Smart Multi-Provider Router
Smart AI router that auto-selects the optimal model across Groq, Gemini, OpenRouter, and Ollama. Classifies tasks across 10 categories and routes to the best model based on speed, quality, cost, and privacy — with automatic fallback, benchmark mode, and a full CLI.
17+ models across 4 providers. Benchmark mode fires the same prompt at all models simultaneously with live result cards, latency bars, and winner badges (⚡ Fastest · 💰 Cheapest · 📝 Most Detailed). Private mode routes sensitive data to local Ollama only — never leaves your machine. Reasoning chains from Qwen3 <think> blocks extracted and shown as collapsible toggles.
Multi-Agent Research System
Autonomous multi-agent pipeline where specialised agents — researcher, analyst, writer — coordinate via CrewAI and Tavily to perform live web research, synthesise findings, and generate structured reports with zero human intervention.
Agents are assigned distinct roles and collaborate through a shared task planning loop. Each agent uses tool-use to fetch live data, reason over it, and pass structured output to the next stage — resulting in a fully autonomous research-to-report pipeline.
LLM Fine-tuning — Mistral-7B Medical QA
QLoRA fine-tuning of Mistral-7B-Instruct-v0.2 on a medical Q&A dataset using PEFT. Trained on Kaggle T4 GPU with full environment compatibility handling. Fine-tuned adapter and live Gradio demo deployed to HuggingFace Hub.
Resolved real-world environment compatibility challenges across bitsandbytes, tokenizers, and transformers versioning. Training tracked with Weights & Biases. The fine-tuned adapter and base model are published to HuggingFace Hub with a live Gradio inference demo.
Multi-Document RAG Chatbot
Production-grade RAG chatbot using a LangGraph ReAct agent with hybrid retrieval — ChromaDB vector search, BM25, and FlashRank re-ranking. Streaming SSE responses via FastAPI async backend with dual Streamlit and Gradio frontends.
Solved key production challenges including async event loop conflicts in FastAPI, streaming SSE via LangGraph ReAct, and ChromaDB/BM25/FlashRank hybrid retrieval tuning. Powered by Groq's llama-3.1-8b-instant with HuggingFace embeddings.
MLOps Pipeline with Drift Monitoring
End-to-end MLOps pipeline integrating MLflow for experiment tracking and model registry, Weights & Biases for training visualisation, and Evidently AI for automatic data drift and model performance monitoring.
Structured as a reproducible, production-mimicking workflow with modular stages for data ingestion, training, evaluation, and monitoring. Evidently AI generates drift reports automatically on new data batches, with all runs logged and versioned in MLflow.
PulmoScan AI — Tuberculosis Detection
Deep-learning diagnostic system for automated TB detection from chest X-rays, featuring real-time Grad-CAM heatmap visualisations and PDF report generation. Deployed on HuggingFace Spaces.
End-to-end diagnostic pipeline built for early TB screening. Custom-trained TBNet CNN architecture analyses chest X-rays, highlights infected regions via Grad-CAM heatmaps, and generates professional PDF diagnostic reports. FastAPI backend with optimised inference pipeline and an intuitive web interface.
Doubt Tutor — AI Study Assistant
Interactive AI-powered study assistant supporting multiple models with PDF, image, and document uploads — offering real-time doubt-solving with contextual memory and multi-modal reasoning.
AI tutor capable of analysing images, PDFs, and text files using Llama 3, Qwen-VL (vision), and Groq models. Includes chat history export, file previews, secure secret handling, and custom UI components.
PTEra — AI-Powered PTE Mock Assessment
Full-scale PTE mock test platform with adaptive difficulty, real-time scoring, section timers, and professional exam-style UI — powered by Groq and Gemini models. Deployed on HuggingFace Spaces.
PTE Academic mock system with Aptitude, Listening, and Reading modules. Features automated question generation via Groq & Gemini, strict time tracking, dynamic content validation, analytics-driven scoring, and a premium custom UI. Secure API handling and modular code architecture.
TruthLens — Fake News Detector
Dual-AI system combining CNN-LSTM deep learning with Gemini API to detect fake news with 94.2% accuracy across 40,000+ articles. Features an analytics dashboard and REST prediction API.
CNN-LSTM Hybrid
Custom NLP architecture
Analytics Dashboard
Chart.js real-time stats
REST API
POST /api/predict
Admin Auth
Session-based login
SQLite + ORM
Persistent prediction logs
94.2% Accuracy
40,000+ article dataset
Trained on 40,000+ news articles. Features real-time predictions, analytics dashboard, prediction history, and dual-model verification using both a custom neural network and Gemini AI.
Building Safety Smoke Detection
Dual-module fire and smoke detection system combining 7 ML classifiers on 62,630 IoT sensor readings (AUC-ROC > 0.999) with MobileNetV2 CNN (96.98% accuracy) and YOLOv8 real-time detection. B.E. Major Project.
Trains 7 classifiers on real IoT sensor data. MobileNetV2 fine-tuned via transfer learning achieves 96.98% validation accuracy. YOLOv8 draws real-time bounding boxes on uploaded images. Full-stack Django app with role-based user management, live ML training, sensor prediction, and CNN+YOLO inference — deployed on Railway.
Experience
Building and shipping production AI systems
Artificial Intelligence Intern
TechZone Software Academy for Training & Research
Hyderabad, India
Key Achievements
- Architected and deployed 4 production-grade AI systems for a live ed-tech platform serving students and educators across Hyderabad
- Engineered a RAG-powered AI tutoring system using LangChain and vector databases, delivering contextual real-time academic support
- Designed and shipped an automated PTE mock-test engine with LLM-based scoring across reading, writing, and listening modules
- Built an MCQ generation and attendance automation system, eliminating manual academic tracking across departments
- Delivered all systems as production-ready FastAPI services with REST-based orchestration and continuous prompt optimization
Technologies Used
AI/ML Intern
RAM Innovative Infotech
Hyderabad, India
Key Achievements
- Completed an intensive AI/ML training program organised in collaboration with Lords Institute CSE-AIML department
- Developed a disease prediction ML model using Python and Scikit-learn, applying supervised learning classification on medical datasets
- Gained hands-on exposure to Django for web-based AI integration and data preprocessing pipeline design
Technologies Used
Currently seeking full-time AI/ML Engineer roles — open to remote and Hyderabad-based opportunities.
Certifications
Verified credentials from leading AI and ML institutions
Machine Learning Specialization
DeepLearning.AI / Coursera
by Andrew Ng
Generative AI with Large Language Models
DeepLearning.AI / Coursera
by DeepLearning.AI
Python for Everybody Specialization
University of Michigan / Coursera
by Dr. Charles Severance
Google AI Essentials
Google / Coursera
by Google Career Certificates
AI/ML with Generative AI — Internship
TechZone Software Academy
by TechZone
AI/ML with Generative AI — Course
TechZone Software Academy
by TechZone
Get In Touch
Open to AI Engineer and ML Engineer roles. Reach out for collaborations, internship opportunities, or project discussions.
Send a Message
📍Based in Hyderabad, India