Add AUTO in PTG which dynamically allocate suitable buffer to receive… by QingleiCao · Pull Request #759 · ICLDisco/parsec

QingleiCao · 2026-02-23T02:21:28Z

… data

bosilca

I understand what you are trying to do here, and I think it is a good step in the right direction. However, I have two issues with the proposed approach:

it loses the type of the incoming data. You are forced to drop the type and rollback to receive data as packed (potentially breaking heterogeneity). I understand that the arena must be set to AUTO but I think in that case the remote_type should be set to the expected type of the data instead of allowing it to be void*.
I am really not sure about the way you compute the device_index ? Without running the code, I would think you should mostly get 0, as the task's inputs are not set during the get_datatype function.

bosilca · 2026-02-24T15:02:37Z

parsec/remote_dep_mpi.c

+    copy->device_private = NULL;
+}
+
+static void remote_dep_auto_gpu_copy_release(parsec_data_copy_t *copy, int device)


The name is confusing as it implies you are releasing the data copy but instead you are just releasing the private memory owned by it.

Why do you need the device ? A coipy is always associated with a device, and it would not make sense to release the memory of a different copy than the one you pass as argument.

bosilca · 2026-02-24T15:18:59Z

parsec/remote_dep_mpi.c

+        output->data.remote.dst_count = remote_size;
+        output->data.remote.src_displ = 0;
+        output->data.remote.dst_displ = 0;
+        output->data.remote.device_index = remote_dep_auto_pick_gpu_device(newcontext);


We have a proper accessor for the task class to identify the device it wants to use. You should be calling the newcontext->taskclass.data_affinity

bosilca · 2026-02-24T15:28:11Z

parsec/remote_dep_mpi.c

+    }
+    parsec_task_t tmp = *task;
+    tmp.selected_device = NULL;
+    if( PARSEC_SUCCESS != parsec_select_best_device(&tmp) ) {


Are you sure this returns what you expect ? For me parsec_select_best_device only works once the task is properly initialized and has all it's input data copies properly set. Before that it will basically count the possible incarnations and returns the ownership of any of the copies. In other words it is mostly ransom or maybe always zero if the input copies are not yet set (which is the most likely).

QingleiCao · 2026-02-24T16:36:05Z

I understand what you are trying to do here, and I think it is a good step in the right direction. However, I have two issues with the proposed approach:

it loses the type of the incoming data. You are forced to drop the type and rollback to receive data as packed (potentially breaking heterogeneity). I understand that the arena must be set to AUTO but I think in that case the remote_type should be set to the expected type of the data instead of allowing it to be void*.

I am really not sure about the way you compute the device_index ? Without running the code, I would think you should mostly get 0, as the task's inputs are not set during the get_datatype function.

Thank you for your comments @bosilca. This is still a draft, and I’ll be making updates intermittently. 😄

Add AUTO in PTG which dynamically allocate suitable buffer to receive…

4c3558b

… data

QingleiCao requested a review from a team as a code owner February 23, 2026 02:21

QingleiCao marked this pull request as draft February 23, 2026 02:21

bosilca reviewed Feb 24, 2026

View reviewed changes

WIP

05dcc44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add AUTO in PTG which dynamically allocate suitable buffer to receive…#759

Add AUTO in PTG which dynamically allocate suitable buffer to receive…#759
QingleiCao wants to merge 2 commits intoICLDisco:masterfrom
QingleiCao:qinglei/variable_size_receive_buffer

QingleiCao commented Feb 23, 2026

Uh oh!

bosilca left a comment

Uh oh!

bosilca Feb 24, 2026

Uh oh!

bosilca Feb 24, 2026

Uh oh!

bosilca Feb 24, 2026

Uh oh!

QingleiCao commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

QingleiCao commented Feb 23, 2026

Uh oh!

bosilca left a comment

Choose a reason for hiding this comment

Uh oh!

bosilca Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

bosilca Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

bosilca Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

QingleiCao commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants