Closed
Description
Timeline
Released Date: Nov. 24th, 2023
Work Items
Communication Features and Interfaces
Platform support
-
- ROCm: add ROCm support (ROCm support #213)
GPU-side communication interfaces
-
- Interfaces: add tile-based communication interfaces (Add tile-based API #145)
Host-side interfaces
-
- Connection: async connection setup (Make setup async to get rid of setup() #167)
-
- SmChannel: improve constructors ([feature]
sm_channel
constructors need to be improved. #148)
- SmChannel: improve constructors ([feature]
-
- Python: Python binding for FIFO ([bug] the
fifo
from proxy cannot be bound bynb
to python #153)
- Python: Python binding for FIFO ([bug] the
-
- Python: Python binding for DeviceSyncer ([feature] python binding for DeviceSyncer #156)
-
- ProxyChannel: warning ahead of IB CQ being full (Warning ahead of CQ being full #202)
Transports support
-
- IB: add IB gather (IB gather WIP #172)
Development Pipeline
-
- CI: add code coverage (Add code coverage report #208)
-
- Docs: update performance numbers on readme (Update documentation #217)
-
- Python Benchmark: add Python-based mscclpp-test ([Feature] Support python-based mscclpp-test #187, mscclpp-test in Python #204)
-
- Python Benchmark: robust correctness test (Robust correctness test #221)
-
- Docker: update Dockerfiles (Templatize Dockerfiles & update workflows #223)
Bug Fix
-
- Semaphore: use atomic (atomic for semaphores instead of fences #188)
-
- Python: GIL release (Release GIL for Python APIs with wait #190)
-
- CMake: check nvidia_peer module during build (Explicit build dependency on
nvidia_peermem
#201)
- CMake: check nvidia_peer module during build (Explicit build dependency on
-
- mscclpp-test: AllReduce bug fix (Fix allreduce bug #197, Fixing the bug in allreduce1 #220)
-
- Interfaces: allow infinite waiting (Allow infinite waiting #200)
-
- Interfaces: fix a DeviceSyncer bug (Fix DeviceSyncer #222)
Metadata
Metadata
Assignees
Labels
No labels