Full refactor of GuideLLM #351

markurtz · 2025-09-19T03:21:52Z

Summary

TODO

Details

TODO

Test Plan

TODO

Related Issues

TODO

… refactor branch Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

…ng config.py to settings.py due to later config additions and potential conflicts in naming Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

…view Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

… for plural Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

…p to avoid conflicts Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

Signed-off-by: jaredoconnell <joconnel@redhat.com>

## Summary  TODO --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [ ] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

## Summary This PR ports the new functionality from `benchmark run` to `benchmark from-file`, and does so in a way that reuses as much code as practical to have one source of truth. ## Details  - Fixes from-file by making it to use the new output format. - Moves code related to the new output formats to separate functions that are called from both benchmark entrypoints. - Moves additional chunks of code out of the large benchmark run entrypoint function for modularity. ## Test Plan Run a benchmark with an output of json or yaml, and use `from-file` to re-import it and export it. You can select any output type supported by `benchmark run`. `guidellm benchmark from-file ./result.json --output-formats console` `guidellm benchmark from-file ./result.yaml --output-formats yaml` ## Related Issues --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [x] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`) --------- Signed-off-by: Jared O'Connell <joconnel@redhat.com>

## Summary Reintroduces a few changes from main --------- Signed-off-by: Samuel Monson <smonson@redhat.com>

Replace scenario entrypoint with a decorator Forward-port get_default and from_file to Scenario Apply scenario args as an update to kwargs Readd scenario support to CLI Signed-off-by: Samuel Monson <smonson@redhat.com>

Signed-off-by: Samuel Monson <smonson@redhat.com>

Signed-off-by: Jared O'Connell <joconnel@redhat.com>

… are not related to not found errors) for better messaging to the user

…factor/propagate_deserializer_errors

…s loading (#413) ## Summary Propagate errors not related to not found dataset errors to surface to the user that the dataset configuration is incorrect and how to address/fix it ## Details  - [ ] ## Test Plan  - ## Related Issues  - Resolves # --- - [ ] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [ ] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

…audio flows (#412) ## Summary Rework scheduling strategies and how workers/worker group uses them to both simplify and ensure race conditions won't happen between worker processes and grabbing timings. Additionally, fix stats accumulation bugs and data loading bugs that were causing audio to fail ## Details  - [ ] ## Test Plan  - ## Related Issues  - Resolves # --- - [ ] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [ ] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

… package and CLI pathways (#414) ## Summary Changed the benchmarking entrypoint to take in an Args object which is now used to load scenarios. It enables a single source of truth in addition to being able to save the exact configurations in the report output. ## Details  - [ ] ## Test Plan  - ## Related Issues  - Resolves # --- - [ ] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [ ] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

Signed-off-by: Samuel Monson <smonson@redhat.com>

…odec (#411) ## TODO - [ ] ~~More flexible version locking in multimodal extras group~~ - Goal with this was to add locking for different torchcodec/torch versions but honestly its not worth the hassle - [x] Check for multi-modal libs being installed - [ ] More testing on `encode_audio` ## Summary  Replaces audio processing libraries with `torchcodec` which eliminates 19 dependencies and brings us inline with what HuggingFace `datasets` is doing. ## Details  - ## Test Plan  - Run against audio server with ```bash guidellm benchmark run \ --target http://localhost:8000 \ --profile "synchronous" \ --max-requests 20 \ --request-type "audio_transcriptions" \ --data "openslr/librispeech_asr" \ --data-args '{"name": "clean", "split": "test"}' ``` --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [x] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

Signed-off-by: Samuel Monson <smonson@redhat.com>

Signed-off-by: Jared O'Connell <joconnel@redhat.com>

## Summary  Adds a `tox` env for updating the lock file. Also allows args for mypy env. --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [ ] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

## Summary Various type fixes with the goal of not breaking anything. --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [x] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

Base version update to what is pushed as latest to enable PR for base…

a2d19cd

… refactor branch Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

markurtz force-pushed the features/refactor/base branch from ebdb6ce to a2d19cd Compare September 19, 2025 03:22

markurtz and others added 19 commits September 19, 2025 03:50

Base version update to what is pushed as latest to enable PR for base…

cd5a92d

… refactor branch Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

core changes for refactor including pyproject.toml updates and renami…

8d6e19a

…ng config.py to settings.py due to later config additions and potential conflicts in naming Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

remove improper readdition of pyhumps

669848d

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

refactors for the utility modules

6b6ed98

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

Remove old pydantic file that is now replaced

d15cf17

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

fixes from copilot review

5b83c2d

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

add refactored scheduler package and tests

c84299b

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

Standardize on plural for modules/packages and update from copilot re…

a7ae737

…view Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

backend refactor implementations

02554b0

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

fixes from copilot review and standardize backend package to backends…

a88605e

… for plural Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

remove renaming changes from benchmark package til after that PR is u…

452eb65

…p to avoid conflicts Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

Add in benchmark package refactor

7829fb8

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

fixes and rebase

4834767

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

fixes from copilot review

61736f5

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

Mock server implementation for guidellm

a28bbe3

fixes from copilot review

bb98193

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

Any missing changes / working state for refactor

a9a082a

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

add in the perf extras

6d0d4c2

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

Complete CSV output

bfc8e50

Signed-off-by: jaredoconnell <joconnel@redhat.com>

sjmonson mentioned this pull request Sep 23, 2025

Fix start_token is not correct #361

Open

4 tasks

sjmonson and others added 8 commits September 24, 2025 12:57

[GuideLLM Refactor] Entrypoint: Reintroduce changes from main (#363)

78615f7

## Summary Reintroduces a few changes from main --------- Signed-off-by: Samuel Monson <smonson@redhat.com>

Update GenerativeTextScenario to match current def

3ac1537

Replace scenario entrypoint with a decorator Forward-port get_default and from_file to Scenario Apply scenario args as an update to kwargs Readd scenario support to CLI Signed-off-by: Samuel Monson <smonson@redhat.com>

Add workaround for pydantic/pydantic#9541

c47a1f6

Signed-off-by: Samuel Monson <smonson@redhat.com>

Rename rate_type -> profile in builtin scenarios

965aca2

Signed-off-by: Samuel Monson <smonson@redhat.com>

Always parse rate as list[float]

d9a4df2

Signed-off-by: Samuel Monson <smonson@redhat.com>

Fix bug where empty constraints in sweep caused error

03f9085

Signed-off-by: Jared O'Connell <joconnel@redhat.com>

markurtz and others added 24 commits October 15, 2025 13:15

Propagate valid failures from HuggingFace datasets loading (ones that…

91f79b7

… are not related to not found errors) for better messaging to the user

Fixes from review

5f4a731

Merge branch 'features/refactor/constant_rate_fixes' into features/re…

eb84935

…factor/propagate_deserializer_errors

Replace pydub, librosa, and soundfile with torchcodec

a401165

Signed-off-by: Samuel Monson <smonson@redhat.com>

Add all group for extras

23d65ed

Signed-off-by: Samuel Monson <smonson@redhat.com>

Fix lock

5a768f8

Signed-off-by: Samuel Monson <smonson@redhat.com>

Rewrite encode_audio to use torchcodec

ec7071b

Signed-off-by: Samuel Monson <smonson@redhat.com>

Dump raw bytes not tensor

aee230c

Signed-off-by: Samuel Monson <smonson@redhat.com>

Code pathway cleanup

c1340b4

Signed-off-by: Samuel Monson <smonson@redhat.com>

Defer multimodal imports

c8e9ff9

Signed-off-by: Samuel Monson <smonson@redhat.com>

Apply copliot fixes

6d036f8

Signed-off-by: Samuel Monson <smonson@redhat.com>

Bump torchcodec verison

cf5a2e3

Signed-off-by: Samuel Monson <smonson@redhat.com>

Add tox lockfile updater

ad192cb

Signed-off-by: Samuel Monson <smonson@redhat.com>

Allow arguments for tox type checks

b65c6ab

Signed-off-by: Samuel Monson <smonson@redhat.com>

Fix mock server type errors

5b38f40

Signed-off-by: Jared O'Connell <joconnel@redhat.com>

More type fixes

cb36c6a

Signed-off-by: Jared O'Connell <joconnel@redhat.com>

Address utility and presentation type errors

1ffe53a

Signed-off-by: Jared O'Connell <joconnel@redhat.com>

Fix type errors in extras

48769c2

Signed-off-by: Jared O'Connell <joconnel@redhat.com>

sjmonson marked this pull request as ready for review October 17, 2025 20:47

sjmonson force-pushed the features/refactor/base branch from 84bc3d8 to fbd417f Compare October 17, 2025 20:48

Merge branch 'main' into features/refactor/base

f9af34d

sjmonson changed the title ~~[GuideLLM Refactor] Base/staging branch for GuideLLM refactor for all PRs to eventually be merged into before landing on main~~ Full refactor of GuideLLM Oct 17, 2025

sjmonson merged commit e787cc1 into main Oct 17, 2025
6 of 16 checks passed

sjmonson deleted the features/refactor/base branch October 17, 2025 21:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Full refactor of GuideLLM #351

Full refactor of GuideLLM #351

Uh oh!

markurtz commented Sep 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Full refactor of GuideLLM #351

Full refactor of GuideLLM #351

Uh oh!

Conversation

markurtz commented Sep 19, 2025

Summary

Details

Test Plan

Related Issues

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants