Skip to content
View Arifuzzamanjoy's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report Arifuzzamanjoy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Arifuzzamanjoy/README.md
Typing SVG

Email LinkedIn Google Scholar ORCID Hugging Face Portfolio Profile Views


🧠 About Me

I'm Arifuzzaman Joy, an AI Research Scholar and Machine Learning Engineer with 6+ years of experience building production-ready AI systems. I specialize in transforming theoretical AI concepts into scalable, enterprise-grade solutions β€” from LoRA fine-tuning for generative models to multi-agent reasoning systems deployed on cloud infrastructure.

πŸ† My work LatentMAS-SLoRA is officially featured in Gen-Verse/LatentMAS β€” a top multi-agent reasoning framework (πŸ€— HuggingFace #1 Paper of the Day).

  • πŸ€– Generative AI Expert β€” LoRA fine-tuning for image & video generation (Flux, Wan 2.2)
  • 🧠 Multi-Agent Systems β€” LatentMAS-SLoRA featured in Gen-Verse/LatentMAS
  • πŸ”¬ Published Researcher β€” SCI journals with Impact Factors up to 7.1 (Q1)
  • πŸ’» Python Automation Specialist β€” Workflow automation, AI agents & scraping
  • ☁️ MLOps Practitioner β€” Docker, Kubernetes, CI/CD on Azure & AWS
Top Languages

πŸ’Ό Experience

Role Organization Period
AI & Machine Learning Engineer (Freelance) Upwork Β· Fiverr Β· Direct Clients 2023 β€” Present
Research Assistant Rajshahi University β€” Solar Lab / AI Lab Mar 2022 β€” May 2023
ERP System Setup & Data Analyst KBEC, Dhaka 3-month contract
  • Freelance ML Engineer β€” Develop and deploy cutting-edge ML/AI models specializing in multi-modal data tasks including image generation, video synthesis, NLP, and voice AI.
  • Research Assistant β€” Conducted research on renewable energy (solar cells) and speech processing; applied ML/DL techniques to analyze simulation data and improve photovoltaic performance.
  • ERP & Data Analyst β€” Implemented Odoo ERP system for business process automation; scraped and organized contact data for niche marketing campaigns.

πŸ’ͺ Technical Expertise

skills = {
    "Languages":        ["Python (7+ yrs)", "SQL (5+ yrs)", "JavaScript", "HTML/CSS", "Bash"],
    "AI / ML":          ["Data Science (6+ yrs)", "Machine Learning (5+ yrs)",
                         "Deep Learning (5+ yrs)", "Agentic AI (2+ yrs)",
                         "NLP", "Computer Vision", "Generative AI"],
    "Frameworks":       ["PyTorch", "TensorFlow", "Hugging Face Transformers",
                         "Diffusers", "Langchain", "OpenCV", "Selenium",
                         "LiveKit", "Librosa", "Gradio", "PEFT"],
    "DevOps & MLOps":   ["Docker (3+ yrs)", "Kubernetes (3+ yrs)",
                         "CI/CD (3+ yrs)", "Git", "GitHub Actions"],
    "Cloud & Infra":    ["Azure (3+ yrs)", "AWS (3+ yrs)", "RunPod",
                         "MongoDB", "Firebase", "MySQL"],
    "Spoken Languages": ["English (Fluent)", "Bangla (Native)"],
}

πŸ† Featured: LatentMAS-SLoRA

Officially featured in Gen-Verse/LatentMAS β€” a leading multi-agent reasoning framework (πŸ€— HuggingFace #1 Paper of the Day, arXiv:2511.20639).

Multi-agent reasoning system augmenting LatentMAS with role-specialized, dynamically switchable LoRA adapters for better specialization and adaptability. Features VLM support (Qwen2.5-VL-7B), latent-space collaboration, RAG integration, and RunPod serverless deployment.

Key Results: +12% accuracy improvement, 2.7Γ— faster inference, 63.6% token reduction vs traditional RAG.

GitHub YouTube Demo Featured

Tech: Python PyTorch PEFT/LoRA Qwen2.5-VL RunPod Docker RAG CI/CD

Architecture:
Planner β†’ Critic (latent)
       β†’ Refiner (latent)
       β†’ Judger (text)
  • Dynamic LoRA routing
  • Domain auto-detection
  • 4 specialized adapters

πŸ”¬ Research & Publications

Published in high-impact SCI/Scopus-indexed journals Β· Google Scholar Β· ORCID Β· ResearchGate

# Paper Journal IF Quartile Year
1 Machine learning assisted revelation of the best performing single hetero-junction thermophotovoltaic cell Sustainable Energy Technologies and Assessments 7.1 Q1 2025
2 Machine Learning-Enabled performance exploration of AuCuSeβ‚„ in thermophotovoltaic cell Solar Energy 6.0 Q1 2024
3 Numerical studies on a ternary AgInTeβ‚‚ chalcopyrite thin film solar cell Heliyon 4.0 Q1 2023
4 Numerical prediction on the photovoltaic performance of CZTS-based thin film solar cell Nano Select β€” β€” 2023
5 Unleashing the Power of Open-Source Transformers in Medical Imaging Int. J. Advanced Computer Science & Applications 0.7 β€” 2024
6 Spectrum estimation for voiced speech using average weighted linear prediction β€” β€” β€” 2024
7 Enhancement of Bone Conducted Speech Using Deep Transfer Learning β€” β€” β€” 2024

πŸ› οΈ Projects

Self-hosted platform for hyper-realistic image generation and editing using Flux LoRA, Gradio, and Hugging Face Diffusers. Supports multi-image input, 4-bit quantization, and batch processing.

Tech: Python Flux LoRA Gradio Diffusers PyTorch CLIP CUDA

Multi-GPU pipeline for high-fidelity image-to-video, text-to-video, and speech-to-video generation. Uses Wan 2.2 with MoE architecture and Gradio UI for self-hosting.

Tech: Python PyTorch torch.distributed FSDP Docker DiT T5

πŸ—£οΈ Voice-Pro: AI-Powered Speech Processing

Web app for speech recognition, translation, and voice cloning across 100+ languages. Supports YouTube processing and real-time translation.

Tech: Python Whisper WhisperX F5-TTS Edge-TTS Deep-Translator

πŸ“ž Humanoid Calling Agent Platform

Full-stack platform for natural multi-modal conversations with real-time SIP/WebRTC telephony and emotionally expressive AI voice interactions.

Tech: Python LiveKit OpenAI Gemini SIP WebRTC

Serverless worker for FLUX.2 Klein 4B text-to-image and image-to-image generation on RunPod.

Tech: Python RunPod Flux Serverless Docker

Open-source video generative models optimized for lower VRAM GPUs with web-based interface.

Tech: Python PyTorch Gradio Wan2.1

Fork of Fooocus for offline image generation with fast presets and UI enhancements.

Tech: Python Gradio Stable Diffusion

Low-cost, cloud-CPU-friendly starter kit for self-hosting AI tools with external sharing.

Tech: Docker n8n LLMs Cloud CPU

Research code for brain tumor classification and segmentation using ConvNeXt V2 and SegFormer. Achieves up to 99.6% diagnostic accuracy.

Tech: Python PyTorch Transformers ConvNeXt V2 SegFormer

AI chatbot project leveraging large language models for conversational intelligence.

Tech: Python LLMs


πŸ“Š GitHub Analytics

GitHub Stats GitHub Streak

Contribution Graph

πŸŽ“ Education

Degree Institution Year Result
B.Sc. in Electrical & Electronic Engineering University of Rajshahi, Bangladesh 2017 β€” 2020 CGPA 3.13
Higher Secondary Certificate (H.Sc.), Science Dhaka Education Board 2015 β€” 2016 GPA 5.00

πŸ“œ Certifications

  • πŸ… Deep Learning with TensorFlow β€” IBM
  • πŸ… Prompt Engineering for ChatGPT β€” Vanderbilt University
  • πŸ… SQL (Advanced) Certificate β€” HackerRank
  • πŸ… Introduction to Programming with MATLAB β€” Vanderbilt University
  • πŸ… Data, Signal, and Image Analysis with MATLAB β€” Coursera

🌐 Find Me Elsewhere

GitHub Scholar HF RG MATLAB arXiv


Random Dev Quote



"The best way to predict the future is to create it." β€” Alan Kay



Last updated: 2026-02-17 · Built with ❀️ by Arifuzzaman Joy

Pinned Loading

  1. Wan2.2_multigpu_runpod_gpu_with_gradio_interface_for_selfhosting_UI Wan2.2_multigpu_runpod_gpu_with_gradio_interface_for_selfhosting_UI Public

    Python 2 2