Skip to content
View ElMatiOfficial's full-sized avatar

Block or report ElMatiOfficial

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ElMatiOfficial/README.md

Hey, I'm Sebastian 👋

Senior ML Engineer based in Colombia 🇨🇴. I build production ML systems at the intersection of credit risk, recommender systems, classical ML, Agentic Infra and GenAI.

🔭 What I'm working on

  • 🏦 Credit risk ML — loss rate forecasting, PD/LGD models, cohort analysis
  • 🤖 GenAI in production — grounded RAG, hybrid retrieval, LLM evaluation, observability, agentic workflows
  • 📊 Data/ML infrastructure — dbt, BigQuery, Airflow, FastAPI, MLflow

🚀 Currently

  • Active in competitive ML — Kaggle: 1 Silver, top 20% in 8 challenges

🤖 GenAI — deeper dive

  • Retrieval — hybrid dense + BM25 with reciprocal rank fusion, cross-encoder reranking (bge)
  • Grounding & safety — citation-enforcing prompts, refusal paths, hallucination reduction
  • Evaluation — Recall@K / MRR on retrieval, faithfulness + LLM-as-judge on generation, offline eval sets wired into CI
  • Observability — Langfuse traces, token-cost monitoring, embedding-drift detection
  • Agents & tools — structured output, multi-step orchestration, guardrailed tool use

📚 Research interests

  • Credit-risk methodology — IV/WoE feature assessment, Platt calibration, SHAP-driven adverse-action reasons
  • Production RAG — hybrid retrieval benchmarks, grounding/faithfulness metrics, cost-latency trade-offs
  • LLM observability — drift detection on query embeddings, telemetry-driven prompt iteration

🛠️ Tech I reach for

Languages · Python · SQL
ML · scikit-learn · XGBoost · LightGBM · PyTorch · Hugging Face
GenAI · LangChain · RAG · sentence-transformers · ChromaDB · cross-encoder rerankers · Langfuse · OpenAI · Anthropic
Data · BigQuery · dbt · Airflow · Snowflake · Delta Lake
Infra · AWS · Docker · FastAPI · Terraform · GitHub Actions
MLOps · MLflow · Champion-Challenger · PSI drift monitoring

🎤 Beyond code

TEDx speaker · Trilingual (ES/EN/PT) · Mentored 13+ engineers

📬 Let's connect

LinkedIn · Kaggle

Pinned Loading

  1. EML-Matemathical-Translator-for-AI EML-Matemathical-Translator-for-AI Public

    Translate mathematical expressions to and from EML (Exp-Minus-Log) trees, the single-operator primitive for elementary mathematics from Odrzywolek (arXiv:2603.21852). Built for AI/ML researchers.

    Python

  2. EML-Research-POC EML-Research-POC Public

    Research POC: Odrzywolek's EML (Exp-Minus-Log) primitive wired into Claude Code as a tool-use capability. 21/21 math benchmark + 8/8 Riemann Hypothesis sub-investigation.

    Python

  3. parameter-golf parameter-golf Public

    Forked from openai/parameter-golf

    Train the smallest LM you can that fits in 16MB. Best model wins!

    Python

  4. realh realh Public

    Open-source proof-of-personhood and content-provenance tooling. Issues W3C Verifiable Credentials signed with Ed25519; verifiable offline via did:web + JWKS.

    JavaScript