Skip to content

Conversation

@EMZEDI
Copy link
Collaborator

@EMZEDI EMZEDI commented Apr 30, 2025

Only a few new scripts to run eval with vllm.

@EMZEDI EMZEDI requested review from Jacob-Chmura and Copilot April 30, 2025 20:03
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR scales evaluations by updating dependencies and modifying the deepspeed accelerate configuration for vllm-based eval scripts.

  • Added additional dependencies (setuptools, ipykernel, matplotlib) in pyproject.toml.
  • Increased num_processes and added main_process_port in the deepspeed configuration file.

Reviewed Changes

Copilot reviewed 3 out of 5 changed files in this pull request and generated no comments.

File Description
pyproject.toml Added dependencies to support additional functionality.
benchmarks/ppo/accelerate_configs/deepspeed_zero2.yaml Updated parallel process count and added a process port setting.
Files not reviewed (2)
  • jobs/validate_all_static.sh: Language not supported
  • jobs/validate_all_static_diversity.sh: Language not supported


# list all sub‐tasks
tasks=(
ultra-hh-sampled
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why don't we no longer need the full lest of datasets as it was before? The loop is much better, just trying to understand why the change

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should these changes be propogated to the other benchmark deepspeed configs?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are the dependency changes temporary?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants