Skip to content
View IlyasFardaouix's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report IlyasFardaouix

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ilyasFardaouix/README.md

Profile Views

AI Engineer | LLM Systems | Multimodal Search | Moroccan NLP

I am Ilyas Fardaoui, an AI Engineer and ENSAM Rabat student focused on building intelligent systems that solve real-world problems. I design and ship production-oriented AI workflows, including retrieval-augmented generation pipelines, multimodal search engines, and data-intensive NLP systems. My strongest edge is working on underrepresented language spaces, especially Darija and Arabic NLP, while bridging academic ideas with practical deployment constraints. I believe in turning research-heavy ideas into practical systems that teams can use, scale, and trust.

Currently

  • 🔭 Working on: Agentic RAG pipelines with LangGraph and real-time multimodal retrieval systems
  • 🌱 Learning: Advanced agent workflows, tool-use patterns, and LLM evaluation frameworks
  • 🤝 Open to: AI engineering roles, research collaborations, and open-source contributions in Arabic/Darija NLP

Tech Stack

AI/ML Core

Python PyTorch TensorFlow scikit-learn OpenCV

LLM & Agents

LangChain LangGraph HuggingFace OpenAI

Data & Search

NumPy Pandas ChromaDB FAISS PostgreSQL

Backend & DevOps

FastAPI Flask Docker Git Linux

Featured Projects

Project Description Stack Domain Status Stars
VisualIndexer Production-grade multimodal search engine combining CLIP embeddings, OCR extraction, and vector similarity retrieval for image-text discovery. Python, CLIP, Tesseract, ChromaDB/FAISS, Streamlit Multimodal Retrieval Active Stars
darija-dataset-builder Scalable data pipeline for building high-quality Moroccan Darija corpora ready for fine-tuning and evaluation workflows. Python, NLP, HuggingFace Datasets, Pandas, MinHash Moroccan NLP Active Stars
Sepsis-Detection Early-warning ML pipeline for sepsis risk prediction using ICU clinical features and interpretable boosting models. Python, scikit-learn, XGBoost, Pandas Healthcare AI Active Stars
Reconnaissance-Faciale-Eigenfaces Classical computer vision implementation of PCA/Eigenfaces for robust face representation and recognition experiments. Python, OpenCV, NumPy, PCA Computer Vision Active Stars
GOLD-TRADING-AI AI-assisted market analysis framework for gold forecasting, signal generation, and risk-aware strategy exploration. Python, Time Series, ML, Visualization FinTech AI Active Stars
YouTube-Sentiment-Analysis End-to-end sentiment pipeline for YouTube comments including preprocessing, modeling, and actionable polarity insights. Python, NLP, scikit-learn, Pandas Social NLP Active Stars

Experience

Role Organization Period Key Achievements
AI Engineering Intern Ministry of Agriculture Morocco (MAPMDREF) Internship - Built a multimodal semantic search engine for practical retrieval use cases.
- Developed OCR pipelines for document and image text extraction.
- Improved vector-based retrieval workflows for higher relevance and speed.
Vice President Fatal Error Club - ENSAM Rabat Leadership Role - Led the club's technical direction and execution strategy.
- Mentored members across software and AI engineering tracks.
- Co-organized CURSOR Meet-up with 190+ participants and technical sessions.

What I'm Building Next

  1. Open-source Darija LLM evaluation benchmark
  2. Agentic document search assistant (RAG + tool use)
  3. Arabic multimodal dataset for visual question answering

GitHub Stats

GitHub Streak

GitHub Stats

Top Languages

Contribution Activity

Snake animation

Let's Connect

Email LinkedIn Instagram

I'm always open to interesting collaborations, AI research projects, and engineering roles. Don't hesitate to reach out!

Pinned Loading

  1. Reconnaissance-Faciale-Eigenfaces Reconnaissance-Faciale-Eigenfaces Public

    Face recognition project based on PCA/Eigenfaces with classical vision techniques.

    Python 1 1

  2. VisualIndexer VisualIndexer Public

    Multimodal visual search engine using CLIP, OCR, and vector similarity retrieval.

    Python 1

  3. Sepsis-Detection Sepsis-Detection Public

    Early sepsis risk prediction pipeline using machine learning on ICU clinical data.

    Python 1

  4. TESTCOURSE TESTCOURSE Public archive

    First test

    Jupyter Notebook 1

  5. YouTube-Sentiment-Analysis YouTube-Sentiment-Analysis Public

    Sentiment analysis pipeline for YouTube comments with end-to-end NLP workflow.

    Python 1

  6. darija-dataset-builder darija-dataset-builder Public

    Scalable pipeline for building Moroccan Darija NLP datasets for LLM training.

    Python 1