[Q&A] Real-world failure: RuntimeError: Failed to get engine\n The client could not register to server. #2589
Replies: 2 comments 3 replies
-
|
Beta Was this translation helpful? Give feedback.
-
@Ultimate-Storm thanks for sharing your issue and logs. You mentioned you have set up provision with this new project.yaml, can you share the logs (server, both clients) as well? Thanks! |
Beta Was this translation helpful? Give feedback.
-
Python version (
python3 -V
)3.10
NVFlare version (
python3 -m pip list | grep "nvflare"
)2.4.0
NVFlare branch (if running examples, please use the branch that corresponds to the NVFlare version,
git branch
)2.4.0
Operating system
Ubuntu 22.04.4 LTS
Have you successfully run any of the following examples?
Please describe your question
I run into the following error when trying to connect a remote client to a server. I have the setup with two separate machines connected within lab network. Provision is configured through Dashboard. Two clients have been registered. Project is running under conda environment and I have ensured that they are identical across the two machines. I checked I have no problem to access dl3.example.com on port 8002 and 8003 from 'dl0'
I have already tried POC on a single machine to host the server, admin, and two clients locally and everything is running fine. But now I have machine 'dl3' hosting server and one client, 'dl0' hosting another client. In this case I failed to connect the client from 'dl0' to the server. Here is the error log from 'dl0'
Log from 'dl3' server:
The above 'Total clients: 2' is because I tried to start the second local client on 'dl3' instead of the remote one just for testing.
Log from 'dl3' client:
Any help is appreciated!
Beta Was this translation helpful? Give feedback.
All reactions