Skip to content

Pull requests: NVIDIA/nvidia-resiliency-ext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Used multi_set for group rank assignment. ci-approved Approved to run CI
#252 opened Jan 28, 2026 by hexinw-nvidia Loading…
feat: NVRX Attribution Service and NVRX Slurm Monitor Service ci-approved Approved to run CI
#248 opened Jan 20, 2026 by namitdhameja Loading…
Helisha/attrsvc combined pr
#245 opened Jan 15, 2026 by helisha91 Loading…
Haim updates new version of logsage and nvdataflow
#243 opened Jan 11, 2026 by helisha91 Loading…
Include Source Git Hash in NVRx Installation ci-approved Approved to run CI
#233 opened Dec 10, 2025 by continue-revolution Loading…
InJob: Include Source Git Hash in NVRx Installation ci-approved Approved to run CI
#229 opened Dec 8, 2025 by continue-revolution Loading…
Infra HC service over UDS ci-approved Approved to run CI
#227 opened Dec 6, 2025 by namitdhameja Loading…
feat: add non-retryable exception pattern matching
#212 opened Oct 28, 2025 by hexinw-nvidia Loading…
CAS profiling
#188 opened Sep 21, 2025 by hexinw-nvidia Draft
Auto restart ci-approved Approved to run CI
#139 opened Aug 6, 2025 by hexinw-nvidia Draft
Add example for multimodal models ci-approved Approved to run CI
#131 opened Jul 25, 2025 by Ava-A4098 Loading…
Added in-process wrapper restart latency
#118 opened Jul 13, 2025 by namitdhameja Loading…
Test UT. ci-approved Approved to run CI
#79 opened May 17, 2025 by hexinw-nvidia Draft
ProTip! Type g i on any issue or pull request to go back to the issue listing page.