Skip to content

fix: add semantic_match scorer to detect drift in math_eval#69

Draft
Yuu6798 wants to merge 27 commits intoopenai:mainfrom
Yuu6798:main
Draft

fix: add semantic_match scorer to detect drift in math_eval#69
Yuu6798 wants to merge 27 commits intoopenai:mainfrom
Yuu6798:main

Conversation

@Yuu6798
Copy link

@Yuu6798 Yuu6798 commented May 4, 2025

Replaces check_equality with semantic_match scorer using sentence-transformers.
This reduces false positives due to semantic drift (>0.2) even when answers match approximately.

@Yuu6798 Yuu6798 marked this pull request as draft May 8, 2025 01:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant