Skip to content
View dinabandhu50's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report dinabandhu50

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dinabandhu50/README.md

πŸ‘‹ Hi, I'm Dinabandhu!

πŸš€ Data Scientist | AI Researcher | Generative AI Developer

I specialize in Machine Learning, Generative AI, LLM Fine-Tuning, and Model Deployment. With over 6+ years of experience, I've worked on cutting-edge AI applications, including static code analysis improvement using LLM, drug toxicity classification using multimodal approach and Finetuning deep learning models (LLM, VLM and pretrained) for custom usecases.

πŸ’‘ I’m passionate about LLM fine-tuning, reinforcement learning, and AI-powered automation. My work involves building scalable AI applications, optimizing deep learning models, and deploying AI systems efficiently using FastAPI and Docker.


πŸ”₯ What I’m Working On

  • LLM Fine-Tuning & RAG – Experimenting with LLM finetuning, LoRA, qLoRA, and multi-GPU training
  • Reinforcement Learning & AI Agents – Implementing from scratch RLHF, PPO, DPO, ORPO, GRPO, OpenAI Agents and Google ADK
  • Investment Portfolio Optimization – Designing a long-term strategy for wealth creation

πŸ›  Tech Stack & Tools

Languages: Python (ML/DL), SQL
AI/ML Frameworks: PyTorch, Huggingface transformers, openai, TRL Library, XGBoost
Backend: FastAPI, Celery, Docker, Redis
Databases: PostgreSQL, MongoDB
Deployment: Git, GitHub Actions, Ubuntu, systemd, Nginx


πŸ“ˆ Current Learning Goals

βœ… Implementing and learning from research papers related to AI
βœ… Fine-tuning & training LLMs from scratch in a scalable way
βœ… Reinforcement Learning (RLHF, PPO, DPO, ORPO, GRPO)
βœ… Multi AI Agents and their applications
βœ… Google Cloud Platform and backend development


πŸ“¬ Let’s Connect

πŸ”— LinkedIn
πŸ’» GitHub
πŸ“§ Email: beheradinabandhu50@gmail.com


πŸ’‘ β€œAI is the new electricity.” - Andrew Ng
πŸš€ Let’s build something awesome together!

Pinned Loading

  1. credit-lead-prediction credit-lead-prediction Public

    Binary classification problem for credit lead prediction. Used XGBoost, RandomForest and CatBoost model for prediction. Optimized hyperparameter using optuna.

    Jupyter Notebook 1

  2. ecrl/padelpy ecrl/padelpy Public

    A Python wrapper for PaDEL-Descriptor software

    Python 223 40

  3. histo-cancer-image-classification histo-cancer-image-classification Public

    This is the binary image classification of histopathologic cancer detection dataset.

    Jupyter Notebook

  4. TPGSJan2022 TPGSJan2022 Public

    Kaggle tabular playground series Jan 2022

    Jupyter Notebook