Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pml/cm,ob1: pack data from application buffer in successive MPI_Start calls #12797

Merged
merged 2 commits into from
Sep 20, 2024

Conversation

wenduwan
Copy link
Contributor

@wenduwan wenduwan commented Sep 5, 2024

This patch fixes a bug exposed by MTT MPI_Bsend_init_overtake_c, where

  • MPI_Bsend_init(buf, ..., req)
  • MPI_Start(req)
  • MPI_Wait(req, stat)
  • modify(buf)
  • MPI_Start(req) // this does not send the updated data
  • MPI_Wait(buf)

This is because PML/CM uses a bounce buffer send_req.req_buff to pack data from the application buffer in MPI_Bsend_int but not MPI_Start.

This patch adds a step to pack data from the application buffer in each MPI_Start call.

This patch fixes a bug exposed by MTT MPI_Bsend_init_overtake_c, where
- MPI_Bsend_init(buf, ..., req)
- MPI_Start(req)
- MPI_Wait(req, stat)
- modify(buf)
- MPI_Start(req) // this does not send the updated data
- MPI_Wait(buf)

This is because PML/CM uses a bounce buffer send_req.req_buff to pack
data from  the application buffer in MPI_Bsend_int but not MPI_Start.

This patch adds a step to pack data from the application buffer in each
MPI_Start call.

Signed-off-by: Wenduo Wang <wenduwan@amazon.com>
Signed-off-by: George Bosilca <gbosilca@nvidia.com>
@wenduwan wenduwan changed the title pml/cm: pack data from application buffer in successive MPI_Start calls pml/cm,ob1: pack data from application buffer in successive MPI_Start calls Sep 10, 2024
@janjust
Copy link
Contributor

janjust commented Sep 10, 2024

@bosilca ready to go?

@bosilca
Copy link
Member

bosilca commented Sep 11, 2024

I should not review my own code. Someone else need to do it. @hppritcha maybe ?

@wenduwan wenduwan removed the request for review from bosilca September 11, 2024 18:15
@wenduwan
Copy link
Contributor Author

Howard should be back from vacation next week or so.

Copy link
Member

@hjelmn hjelmn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@bosilca bosilca merged commit e4b98d7 into open-mpi:main Sep 20, 2024
17 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants