I specialize in building autonomous AI agents, fine-tuning LLMs, and architecting scalable MLOps pipelines. I'm passionate about creating systems that don't just generate text, but reason, reflect, and verify their own outputs.
Autonomous Researcher with Self-Reflection.
- Built with LangGraph to implement a Generate-Critique-Refine loop.
- Reduces LLM hallucinations by fact-checking claims against real-time web results.
- Tech:
Multi-task Fine-Tuning on a Single Backbone.
- Fine-tuned Microsoft Phi-3 using dual LoRA adapters for specialized Code and Docstring generation.
- Dynamic adapter switching at inference time.
- Tech:
Intelligent Document Q&A System.
- Processes PDFs and Websites with structural and table extraction.
- Built-in monitoring dashboard for latency, cost, and retrieval quality.
- Tech:
Latent Diffusion Model from Scratch.
- Distributed training on AWS SageMaker with DeepSpeed.
- Optimized for inference using ONNX and TensorRT.
- Tech:
I write about the engineering behind my projects. Check out my latest blogs:
- π° This AI Agent Does 3 Hours of Research in 8 Seconds
- π° BiLoRA: Dual Adapter Fine-Tuning for Code & Docstrings
- π° I Built a RAG System That Actually Understands Your Documents
- π° Text-To-Image with Diffusion & AWS SageMaker
- π° Text Summarization GPT from Scratch
Languages:
AI/ML:
MLOps:
Cloud/Infra:
