Skip to content
View adrmisty's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report adrmisty

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
adrmisty/README.md

Adriana R. Flórez

Software Engineer | NLP Specialist | Computational Linguist

I specialize in multilingual NLP techniques, and applied research in language technologies. I am a firm believer in tech for social good and ethical AI.

Experience

  • NLP Researcher Intern @ Vicomtech (Nov 2025 – Present) | Donostia, Spain
    • Neural topic modeling, RAG systems, and information extraction for EU R&D (EU-Farmbook).
    • x-AI and argument mining for medical NLP (IBERLEF2026 Shared Task 'GRACE').
  • Data Scientist Intern @ Bedrock (Mar – Jun 2024) | Gijón, Spain
    • B2B recommender systems and computer vision model optimization.
  • Front-End Developer Intern @ Transparent Edge (Jun – Sep 2023) | Remote
    • Front-end development for CDN infrastructure and API integration.

Education

  • MSc in Language and Communication Technologies (Erasmus Mundus) (2024 – 2026)
    • UPV/EHU (Spain) & Charles University (Czechia)
    • Thesis: Automated identification and adaptation prediction of loanwords in low-resource languages (Asturian, Basque, Greek).
  • BSc in Software Engineering (2020 – 2024)
    • University of Oviedo (Spain) & Virginia Tech (USA) & Haliç University (Türkiye)
    • Thesis: LLM-based automatic multilingual localization service for software applications.

Skills

  • AI & NLP: PyTorch, TensorFlow, Hugging Face, spaCy, NLTK, WhisperX, UDPipe, BERTopic, scikit-learn
  • Programming: Python, Java, C++, C#, Swift, JavaScript, HTML/CSS
  • Engineering: Git, Docker, Bash, FastAPI, Linux, MySQL, Oracle
  • Human Languages: Spanish & Asturian (Native), English (C2), Greek (B2), German (B1), French (A2), Basque (A1)

Connect

Pinned Loading

  1. tfm-low-res-lexical-borrowings tfm-low-res-lexical-borrowings Public

    Repo for the development of my Master's thesis (ÚFAL / EHU) on 'Automatic detection and prediction of lexical borrowings in low-resource languages'.

    Python 1

  2. grace-iberlef26 grace-iberlef26 Public

    GRACE 'Granular Recognition of Argumentative Clinical Evidence' shared task work for IBERLEF26, with preliminary work on the CasiMedicos-Arg dataset

    Python 1

  3. doc-quality doc-quality Public

    Project for automatic document quality validation (structural & topical relevance).

    Python 1

  4. deep-learning deep-learning Public

    NPFL138 course projects @ ÚFAL Matfyz (Fall '24), taught by Milan Straka.

    Python 1

  5. synth-qa-RAG-dataset synth-qa-RAG-dataset Public

    Generation of a silver standard Q&A dataset for RAG evaluation.

    Python 1

  6. parallel-speech-data parallel-speech-data Public

    NPFL087 Machine Translation project for achievement of Spanish-Greek parallel audio dataset, taught by Ondřej Bojar.

    Jupyter Notebook 1