emnlp2025

Beyond One World — A benchmark for testing how well LLMs role-play version-specific characters (e.g., superheroes across universes). Covers 30 heroes and 90 canon variants through two tasks: Canon Events (factual recall) and Moral Dilemmas (ethical reasoning). Introduces the Think-Act Matching metrices.

agent roleplay emnlp wordplay emnlp2025

Updated Oct 29, 2025
Python

ApplyU-ai / ResumeBench

Star

Beyond Human Labels: A Multi-Linguistic Auto-Generated Benchmark for Evaluating Large Language Models on Resume Parsing [EMNLP 2025 Main Conference]

resume benchmark evaluation dataset resume-parser large-language-models llm emnlp2025

Updated Nov 4, 2025

SecurityLab-UCD / FuzzAug

Star

[EMNLP'25] FuzzAug: Data Augmentation by Coverage-guided Fuzzing for Neural Test Generation

rust emnlp test-generation data-augmentation llm emnlp2025

Updated Sep 18, 2025
Python

bhimanbaghel / ResolveUnderOverEdit

Star

Official implementation of "Resolving UnderEdit & OverEdit with Iterative & Neighbor-Assisted Model Editing" (EMNLP 2025 Findings).

nlp machine-learning natural-language-processing transformers pytorch large-language-models llms model-editing knowledge-editing emnlp2025

Updated Nov 14, 2025
Python

sinaabbasi1 / NormXLogit

Star

The official repo for the EMNLP 2025 paper "NormXLogit: The Head-on-Top Never Lies"

nlp transformers interpretability explainability plausibility llm faithfulness emnlp2025

Updated Nov 5, 2025
Jupyter Notebook

Rongite / Persuasion

Star

Code & reproducibility for the EMNLP paper “Profiling LLMs’ Copyright Infringement Risks under Adversarial Persuasive Prompting”: prompts, seeds queries, and figure scripts.

nlp persuasion jailbreak copyright adversarial-attacks llm prompting emnlp2025

Updated Sep 20, 2025
Python

Ebad-urRehman / MAHED_2025_subtask1_hate_and_hope

Star

This repository contains the code and detailed analysis regarding competition and system paper I will submit regarding MAHED 2025 subtask1(hate and hope speech classification) in Arabic NLP colocated with EMNLP.

arabic-nlp hate-speech-detection acl2025 emnlp2025 mahed-2025

Updated Aug 20, 2025
Jupyter Notebook

aauss / temporal-answer-qa

Star

Time to Revisit Exact Match (Findings of EMNLP 2025)

evaluation question-answering emnlp temporal-reasoning large-language-models emnlp2025

Updated Sep 22, 2025
Python

Improve this page

Add a description, image, and links to the emnlp2025 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the emnlp2025 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

emnlp2025

Here are 14 public repositories matching this topic...

corl-team / flexsae

vivo / DiMo-GUI

shimo-lab / modelmap

parameterlab / leaky_thoughts

idramalab / quantify-llm-explanations

madhavkrishangarg / ReviewEval

Augustus2011 / Beyond_One_World

ApplyU-ai / ResumeBench

SecurityLab-UCD / FuzzAug

bhimanbaghel / ResolveUnderOverEdit

sinaabbasi1 / NormXLogit

Rongite / Persuasion

Ebad-urRehman / MAHED_2025_subtask1_hate_and_hope

aauss / temporal-answer-qa

Improve this page

Add this topic to your repo