Replies: 2 comments 9 replies
-
The |
Beta Was this translation helpful? Give feedback.
-
Hi @holgerroth, @Can-Zhao and @ZiyueXu77, Let's assume 3 of my 9 clients share the noisy gradients of one specific parameter (after SVT). Are these gradients going to be averaged by 9 clients or by 3 clients? It seems like there is only one global weighting for the parameters available, which may lead to much more noisy avg gradients than originally intended by SVT. |
Beta Was this translation helpful? Give feedback.
-
Hi all,
I added SVT to my current FL Training round, but it lowers the performance tremendously. I am aware, that DP will generally lower the performance, however, mine is lowered by almost 45%. Even when I am sharing all weights. I am still trying out different configurations of the SVT, but I just want to make sure that I am not missing anything important to generally make it work.
I am training with fedprox and my learner is based on the CIFAR10 learner provided in the examples. For SVT parameters, I am primarily playing around with espilon and the noise_var values for now, keeping fraction at 1.0.
Is there anything wrong with my current approach or do I just have to be patient to find a good SVT configuration?
This is my current config client file:
Beta Was this translation helpful? Give feedback.
All reactions