Welcome to FL-DLT3, created for experimental research in an article published on IEEE Internet of Things Journal.
Here is a structure of FL-DLT3:
Illustration of the proposed FL-DLT3 framework, where a policy network, a policy target network, two critic networks, and two critic target networks are trained. Each of the networks consists of a feedforward branch and a recurrent branch. A training round of FL is performed by the selected IoT devices with the allocated transmit power.Then, the edge server gets the reward
J. Zheng, K. Li, N. Mhaisen, W. Ni, E. Tovar and M. Guizani, "Exploring Deep Reinforcement Learning-Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT," in IEEE Internet of Things Journal, doi: 10.1109/JIOT.2022.3176739. The more details can be found here
$ CUDA_VISIBLE_DEVICES=0,1 python jingjing_td3_lstm_v8.py --train
@article{zheng2022exploring,
title={Exploring Deep-Reinforcement-Learning-Assisted federated learning for Online Resource Allocation in Privacy-Preserving EdgeIoT},
author={Zheng, Jingjing and Li, Kai and Mhaisen, Naram and Ni, Wei and Tovar, Eduardo and Guizani, Mohsen},
journal={IEEE Internet of Things Journal},
volume={9},
number={21},
pages={21099--21110},
year={2022},
publisher={IEEE}
}