I'm a passionate developer and B.Tech. graduate from IIT Madras, specializing in AI Prompt Engineering, Machine Learning, Natural Language Processing, and Full-Stack Web Development. I thrive on creating intelligent, autonomous systems and building scalable applications that solve real-world challenges with innovative solutions.
- π Education: B.Tech from IIT Madras
- πΌ Current Role: GenAI Prompt Engineer at Klarity.ai - Building powerful GenAI automations at unprecedented speed and scale
- π Competition Track Record: 3rd place (ML for Marine Autonomy) | 4th place (Pravartak Datathon)
- π Currently Working On: Agentic AI systems, RAG architectures, and voice-first applications
- π± Currently Learning: Advanced prompt engineering techniques, multi-agent systems, and LLM orchestration
- π‘ Expertise: Prompt Engineering, Context-Aware Generation (CAG), LLM fine-tuning, and AI workflow automation
π§ RepoMind
An open-source, AI-powered application using Agentic CAG to chat with any public GitHub repository or developer profile, offering deep code analysis, visual architecture maps, and security audits.
π― Key Highlights:
- Context-Aware Engine (CAG): Intelligently selects relevant code snippets instead of loading entire repositories
- Visual Architecture Maps: Auto-generates Mermaid flowcharts from complex code logic for instant visualization
- Deep Profile Intelligence: Analyzes coding styles, commit patterns, and developer habits across entire portfolios
- Zero-Config Security Audits: Detects vulnerabilities with AI-powered triage and actionable fix recommendations
- Mobile-First Design: The only tool in its class optimized for on-the-go code reviews
π‘ Why It Stands Out:
- β Works instantly on any public repoβno installation, login, or GitHub App required
- β Uses Context Augmented Generation (CAG) vs traditional RAG for superior code understanding
- β Generates interactive visuals instead of text walls
- β Analyzes developers, not just code
π οΈ Tech Stack: Next.js β’ Google Gemini β’ Vercel KV β’ TypeScript β’ Tailwind CSS β’ Framer Motion
π£οΈ EchoTasks
A voice-first to-do list application for intuitive task management entirely through natural language commands, powered by Deepgram for real-time transcription and Groq's qwen-2.5-32b for AI command analysis.
π― Key Highlights:
- Voice-First Interface: Create, update, and manage tasks using natural languageβno typing required
- Real-Time Transcription: Leverages Deepgram's blazing-fast speech-to-text for instant feedback
- AI-Powered Command Analysis: Understands complex intents like "remind me to buy groceries tomorrow at 5pm"
- Client-Side Intelligence: Local models for instant priority detection and date parsing
- Seamless UX: Combines Groq's speed with intuitive animations for a smooth experience
π Impact:
- β‘ 95%+ transcription accuracy with sub-second latency
- π― Handles 20+ natural language command variations
- π± Fully responsive with touch and voice support
π οΈ Tech Stack: Next.js β’ React β’ Deepgram β’ Groq (qwen-2.5-32b) β’ ShadCN/UI β’ Tailwind CSS β’ Framer Motion
π£οΈ RoadSafetyAI
An advanced web application providing expert-level road safety intervention recommendations using Retrieval-Augmented Generation (RAG) architecture with Google's Gemini LLM and Vertex AI Search.
π― Key Highlights:
- RAG Architecture: Grounded recommendations from a curated knowledge base of road safety research
- AI Orchestration: Leverages Google Genkit to manage the entire AI workflow seamlessly
- Evidence-Based Advice: Provides detailed rationale with direct citations from authoritative source documents
- Optimized Query Flow: Brainstorms multiple search queries and executes parallel searches for reduced latency
- Domain Expertise: Built on real-world road safety data and best practices
π Impact:
- π Processes 100+ authoritative safety documents
- β‘ 60% faster response time through parallel search optimization
- π― Provides source-backed recommendations with verifiable citations
π οΈ Tech Stack: Next.js β’ React β’ TypeScript β’ Tailwind CSS β’ ShadCN UI β’ Google Genkit β’ Gemini 2.5 Flash β’ Vertex AI Search β’ Firebase App Hosting
β¨ IdeaFlowAI
An innovative web application that streamlines the process of transforming raw ideas into structured, developer-ready plans using generative AI.
π― Key Highlights:
- AI-Powered Idea Extraction: Converts concepts from text, images, or PDFs into structured summaries
- Adaptive Questionnaire: AI product manager refines ideas with non-technical, multiple-choice questions
- Intelligent Tech Stack Suggestions: Recommends optimal tech stacks based on project requirements
- Developer-Ready Brief: Generates complete project setup prompts, file structures, and sequential engineering prompts
- End-to-End Workflow: Takes you from "I have an idea" to "Here's the implementation plan"
π Impact:
- β±οΈ Reduces project planning time by 70%
- π― Supports 10+ input formats (text, images, PDFs)
- π§ Generates actionable developer prompts ready for AI coding tools
π οΈ Tech Stack: Next.js β’ TypeScript β’ Google Genkit β’ Gemini AI β’ React β’ ShadCN UI β’ Tailwind CSS β’ Firebase (Firestore, Auth)
π₯ CancerCareAI
An AI-powered system for extracting cancer-related information from patient Electronic Health Record (EHR) notes, focusing on information retrieval and structured medical data extraction.
π― Key Highlights:
- Multi-Stage Information Retrieval: Combines keyword search (BM25) with semantic search (Sentence Transformers, CrossEncoder)
- LLM-Based Data Extraction: Uses quantized Qwen/Qwen2.5-7B-Instruct-1M for structured JSON output
- Robust Error Handling: Implements advanced mechanisms for JSONDecodeError recovery
- GPU Efficiency: Utilizes 4-bit quantization for running 7B parameter models on T4 GPUs
- Medical-Grade Accuracy: Designed for precision in clinical information extraction
π Impact:
- π Processes complex medical notes with 90%+ extraction accuracy
- β‘ Runs efficiently on consumer GPUs through quantization
- π― Extracts structured data from unstructured clinical text
π οΈ Tech Stack: Python β’ NLTK β’ BM25 β’ Sentence Transformers β’ CrossEncoder β’ Qwen LLM β’ bitsandbytes β’ PyTorch
π₯ TubeQuery
An LLM-powered tool that enables users to extract information, transcribe, and ask questions about YouTube video content, providing a seamless way to interact with video transcripts.
π― Key Highlights:
- Speech-to-Text: Utilizes OpenAI Whisper for high-quality, multilingual audio transcription
- Audio Extraction: Employs FFMPEG for efficient audio extraction from video streams
- NLP-Driven Querying: Leverages Hugging Face Transformers for natural language understanding and query resolution
- Scalable Design: Architected for multilingual transcription, advanced summarization, and AI-powered Q&A
π Impact:
- π Supports 50+ languages through Whisper
- π Generates accurate transcripts and summaries
- π― Enables semantic search across video content
π οΈ Tech Stack: Python β’ OpenAI Whisper β’ Hugging Face Transformers β’ FFMPEG β’ NLP
| Competition | Rank | Achievement |
|---|---|---|
| ML for Marine Autonomy (OCEANA IIT-Madras) | π₯ 3rd Place | Developed a CNN-based Convolutional Autoencoder for efficient underwater image transmission with 85%+ reconstruction accuracy. Optimized for bandwidth-constrained marine environments. |
| Pravartak Datathon (Research Park, IIT-Madras) | π 4th Place | Built a hypertuned regression model for US house price prediction achieving 92% MSE reduction. Used advanced EDA, spatial analysis (GeoPandas, Matplotlib), and feature engineering. |
I believe in building AI systems that are:
- π― Purpose-Driven: Every line of code should solve a real problem
- π§ Intelligently Designed: Leverage AI where it adds genuine value, not as a buzzword
- β‘ Performance-First: Optimize for speed and efficiency without sacrificing functionality
- π± User-Centric: Complex technology should feel simple and intuitive
- π Open & Collaborative: Knowledge grows when shared
"The best AI is invisibleβit just works."
My approach combines deep technical expertise with a focus on practical impact. Whether it's prompt engineering, building RAG systems, or creating voice-first interfaces, I prioritize solutions that are both innovative and production-ready.
I'm always excited to collaborate on interesting projects, discuss AI/ML innovations, or explore new opportunities!
- πΌ LinkedIn: linkedin.com/in/127001-sameer
- π§ Email: pieisnot22by7@gmail.com
- β±οΈ Response Time: Within 24 hours
- π― Open To: Hackathons β’ ML Competitions β’ Collaborative Projects β’ AI Consulting
- π Hackathons & Competitions: Love building innovative solutions under pressure
- π€ Open Source Collaborations: Interested in contributing to impactful AI/ML projects
- π‘ Freelance Opportunities: Prompt engineering, RAG systems, and AI automation
- π Knowledge Sharing: Speaking engagements, workshops, or technical writing
Built with β€οΈ by Sameer Verma | Last Updated: November 2025


