-
Notifications
You must be signed in to change notification settings - Fork 4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multi rdma device not working in rdma brpc branch #1401
Comments
#1183 |
brpc has no plan to support multi rdma device. |
@wwbmmm
According to the experiment's result, brpc doesn't support multi rdma devices, so it need to specify which device should be used with the option brpc supports RoCE LAG device, right? |
@changchengx |
@Tuvie |
We have multi rdma NIC in server. If we use the first rdma device by use --rdma_device=mlx5_0. The brpc client and server works well. But if we use other rdma device like --rdma_device=mlx5_1, the client cannot communicate to the server. It will show the following error:
The rdma_create_qp will failed with errno=22 (invalid param). The root cause is that the GetRdmaProtectionDomain() is use the global context, I mean that the global context may support support well or have some bugs? I have write my custom client and use rdma_cm_id->verbs as context. Then I can connect the brpc server with other rdma device.
But in the rdma_helper.cpp the pd is init before the rdma_cm_id.
So my question is does someone have the same issue will me. And how to fixed the bug, do we need the create the pd after the rdma_cm_id is created.
The text was updated successfully, but these errors were encountered: