Skip to content

feat: add OLMES variant of HumanEval task#185

Open
tfburns wants to merge 4 commits intomainfrom
human_eval
Open

feat: add OLMES variant of HumanEval task#185
tfburns wants to merge 4 commits intomainfrom
human_eval

Conversation

@tfburns
Copy link
Collaborator

@tfburns tfburns commented Feb 26, 2026

PR Checklist

  • Use descriptive commit messages.
  • Provide tests for your changes.
  • Update any related documentation and include any relevant screenshots.
  • Check if changes need to be made to docs (README or any guides in /docs/).

What type of PR is this? (check all applicable)

  • Refactor
  • Feature
  • Bug Fix
  • Optimization
  • Documentation Update

Description

Adds the OLMES variant of the HumanEval

Added/updated tests?

  • Yes
  • No, and this is why: please replace this line with details on why tests
    have not been included
  • I need help with writing tests

@tfburns tfburns marked this pull request as ready for review February 26, 2026 14:41
class HumanEval_OLMES(HumanEval):
"""HumanEval OLMES variant replicating codex_humaneval:3shot::olmo3:n32:v2 from oe_eval.

Recommended EvalConfig settings for full replication:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not related to the PR -- is there any way we can enforce these automatically?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants