feat: Adding nixl read() multimodal support for vLLM backend #4271

KrishnanPrash · 2025-11-12T21:06:05Z

Overview:

With #3988, we have functional image decoding in the frontend for any b64 or http urls passed with the inference request. This PR builds on top of #3988, and implements the nixl read() portion of the image decoding workflow for the backend.

Details:

Look at handlers.py for the additions to the DECODED workflow.

Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>

Signed-off-by: Krishnan Prashanth <kprashanth@nvidia.com>

…h/vllm-nixl-read Signed-off-by: KrishnanPrash <140860868+KrishnanPrash@users.noreply.github.com>

Signed-off-by: Krishnan Prashanth <kprashanth@nvidia.com>

KrishnanPrash · 2025-11-13T23:45:14Z

components/src/dynamo/vllm/handlers.py

+    async def _read_decoded_image_via_nixl(
+        self, decoded_meta: Dict[str, Any]
+    ) -> PIL.Image.Image:
+        """Read decoded image via NIXL RDMA and convert to PIL.Image."""
+        # Lazy-init connector
+        if self._connector is None:
+            self._connector = connect.Connector()
+            await self._connector.initialize()
+            logger.info("NIXL connector initialized for decoded media")
+
+        # Extract fields
+        meta_str = decoded_meta["nixl_metadata"]
+        desc = decoded_meta["nixl_descriptor"]
+        shape = decoded_meta["shape"]
+
+        # Create tensor to receive RDMA data
+        tensor = torch.empty(shape, dtype=torch.uint8)
+
+        # Build RdmaMetadata from frontend-provided descriptor
+        # Frontend sends compressed metadata (matches Python nixl_connect)
+        rdma_meta = RdmaMetadata(
+            descriptors=[
+                SerializedDescriptor(
+                    device="cpu"
+                    if desc.get("mem_type") == "Dram"
+                    else f"cuda:{desc.get('device_id', 0)}",
+                    ptr=desc["addr"],
+                    size=desc["size"],
+                )
+            ],
+            nixl_metadata=meta_str,
+            notification_key=f"img-{shape}",
+            operation_kind=int(OperationKind.READ),
+        )
+
+        # RDMA read
+        read_op = await self._connector.begin_read(
+            rdma_meta, connect.Descriptor(tensor)
+        )
+        await read_op.wait_for_completion()


Not a NIXL expert, so please let me know if I can be doing anything here better.

KrishnanPrash · 2025-11-13T23:47:48Z

lib/llm/src/preprocessor/media/rdma.rs

+    // Compress metadata before base64 encoding (matches Python nixl_connect behavior)
+    // Backend expects: b64:<base64_of_compressed_bytes>
+    let mut encoder = ZlibEncoder::new(Vec::new(), Compression::new(6));
+    encoder.write_all(&nixl_md)?;
+    let compressed = encoder.finish()?;


Once again, welcome any suggestions on correct nixl usage.

KrishnanPrash · 2025-11-13T23:54:01Z

Open Question for Testing:

Ideally, we would like to test both test cases:

Frontend URL pass through + backend decoding: This requires building without nixl.
Frontend decoding + backend nixl read: This requires building dynamo with the command maturin develop --features media-nixl

Based on my conversation with @nv-tusharma, IIUC they suggested creating a separate workflow outside .github/workflows/container-backends-validation.yaml that would be a non-blocking test that would still run in our current CI.

rmccorm4 · 2025-11-14T00:05:19Z

components/src/dynamo/vllm/handlers.py

+        1. Url: Frontend passes URL, backend decodes
+        2. Decoded: Frontend decoded, NIXL RDMA transfer


How does a user control which one happens (1) or (2)?

The only way to opt-in/opt-out depends on what flags are included at build-time (--features media-nixl). Do you have a better workflow in mind? Might be worth mentioning on #3988 as well.

I think it would be worthwhile to have a argument at startup time for this.
Which would be provided to frontend and workers

After reading @milesial 's doc here: https://github.com/ai-dynamo/dynamo/pull/3988/files?short_path=c817023#diff-c817023e4a07199f620dfc8dbf04021b0edc558d6b30b7e8bbb089615dc040ec

It sounds to me like passing media_decoder and media_fetcher to register_llm enables the feature / hints the frontend to do the decoding if available. Please read up on that part and see if that approach makes sense to you or not @indrajit96

indrajit96 · 2025-11-14T00:56:49Z

components/src/dynamo/vllm/handlers.py

+        1. Url: Frontend passes URL, backend decodes
+        2. Decoded: Frontend decoded, NIXL RDMA transfer


I think it would be worthwhile to have a argument at startup time for this.
Which would be provided to frontend and workers

indrajit96 · 2025-11-14T01:07:10Z

lib/llm/src/preprocessor/media/rdma.rs

-    let b64_encoded = general_purpose::STANDARD.encode(&nixl_md);
+    // Compress metadata before base64 encoding (matches Python nixl_connect behavior)
+    // Backend expects: b64:<base64_of_compressed_bytes>
+    let mut encoder = ZlibEncoder::new(Vec::new(), Compression::new(6));


NIT:
Rename the encoder to zlib_encoder.
Can confuse reader with Encoder from E->P->D

indrajit96 · 2025-11-14T01:13:12Z

lib/llm/src/preprocessor/media/rdma.rs

-    let b64_encoded = general_purpose::STANDARD.encode(&nixl_md);
+    // Compress metadata before base64 encoding (matches Python nixl_connect behavior)
+    // Backend expects: b64:<base64_of_compressed_bytes>
+    let mut encoder = ZlibEncoder::new(Vec::new(), Compression::new(6));


Just for my understanding.
I'm curious you don't need to uncompress on the worker?

milesial and others added 8 commits November 10, 2025 14:18

feat: decoded media via NIXL

b9f3484

Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>

feat: NIXL stub support

b6ca505

Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>

chore: cleanups

f145c1c

Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>

fix: image-rs serde

cb0d388

Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>

feat: switch to media-nixl feature flag

b0221bb

Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>

chore: cleanups

a2687d2

Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>

Initial Working Version

518e768

Signed-off-by: Krishnan Prashanth <kprashanth@nvidia.com>

Simplified workflow

76a4683

Signed-off-by: Krishnan Prashanth <kprashanth@nvidia.com>

KrishnanPrash requested review from a team as code owners November 12, 2025 21:06

pull-request-size bot added the size/XL label Nov 12, 2025

KrishnanPrash marked this pull request as draft November 12, 2025 21:06

github-actions bot added the feat label Nov 12, 2025

KrishnanPrash closed this Nov 12, 2025

KrishnanPrash reopened this Nov 13, 2025

Merge branch 'alexandrem/frontend-image-decoding-nixl' into kprashant…

3089249

…h/vllm-nixl-read Signed-off-by: KrishnanPrash <140860868+KrishnanPrash@users.noreply.github.com>

pull-request-size bot added size/L and removed size/XL labels Nov 13, 2025

copy-pr-bot bot temporarily deployed to GITLAB November 13, 2025 23:30 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 13, 2025 23:35 Inactive

Cleaning up comments + Logs

e5c495b

Signed-off-by: Krishnan Prashanth <kprashanth@nvidia.com>

copy-pr-bot bot temporarily deployed to GITLAB November 13, 2025 23:42 Inactive

KrishnanPrash commented Nov 13, 2025

View reviewed changes

copy-pr-bot bot temporarily deployed to GITLAB November 13, 2025 23:46 Inactive

KrishnanPrash commented Nov 13, 2025

View reviewed changes

KrishnanPrash marked this pull request as ready for review November 13, 2025 23:54

KrishnanPrash changed the title ~~feat: Adding nixl read support for decoded path~~ feat: Adding nixl read() multimodal support for vLLM backend Nov 13, 2025

rmccorm4 reviewed Nov 14, 2025

View reviewed changes

rmccorm4 requested review from indrajit96, krishung5 and whoisj November 14, 2025 00:18

KrishnanPrash requested a review from rmccorm4 November 14, 2025 00:28

indrajit96 reviewed Nov 14, 2025

View reviewed changes

rmccorm4 requested a review from ayushag-nv November 14, 2025 18:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Adding nixl read() multimodal support for vLLM backend #4271

feat: Adding nixl read() multimodal support for vLLM backend #4271

KrishnanPrash commented Nov 12, 2025 •

edited

Loading

Uh oh!

KrishnanPrash Nov 13, 2025

Uh oh!

KrishnanPrash Nov 13, 2025 •

edited

Loading

Uh oh!

KrishnanPrash commented Nov 13, 2025 •

edited

Loading

Uh oh!

rmccorm4 Nov 14, 2025

Uh oh!

KrishnanPrash Nov 14, 2025 •

edited

Loading

Uh oh!

indrajit96 Nov 14, 2025

Uh oh!

rmccorm4 Nov 14, 2025

Uh oh!

indrajit96 Nov 14, 2025

Uh oh!

indrajit96 Nov 14, 2025

Uh oh!

indrajit96 Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		1. Url: Frontend passes URL, backend decodes
		2. Decoded: Frontend decoded, NIXL RDMA transfer

feat: Adding nixl read() multimodal support for vLLM backend #4271

Are you sure you want to change the base?

feat: Adding nixl read() multimodal support for vLLM backend #4271

Conversation

KrishnanPrash commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Details:

Uh oh!

KrishnanPrash Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

KrishnanPrash Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

KrishnanPrash commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rmccorm4 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

KrishnanPrash Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

indrajit96 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

rmccorm4 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

indrajit96 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

indrajit96 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

indrajit96 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

KrishnanPrash commented Nov 12, 2025 •

edited

Loading

KrishnanPrash Nov 13, 2025 •

edited

Loading

KrishnanPrash commented Nov 13, 2025 •

edited

Loading

KrishnanPrash Nov 14, 2025 •

edited

Loading