Asynchronous Advantage Actor-Centralized-Critic with Communication (A3C3)

A distributed asynchronous actor-critic algorithm in a multi-agent setting with differentiable communication and a centralized critic.

Check out learned policies here: https://youtu.be/fB71yKcP3iU

Contains 4 environment suites:

POC Suite: Hidden Reward, Navigation, Pursuit, Traffic Intersection
MPE Suite: Cooperative Navigation, Cooperative Communication, Cooperative Reference, Tag
KiloBot Suite: Light, Join, Split
3d Soccer Simulation Suite: Passing, Keep-Away

Also contains scripts to launch A3C3 and learn policies. Use the requirements.txt to install your dependencies and run the scripts.

Each agent is defined by 3 networks.

The algorithm is distributed, and multiple workers update the networks.

The actor network learns a local policy.

The centralized critic evaluates the policy.

The communicator network learns a communication protocol between agents.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
BlindGroupUp		BlindGroupUp
FCPKeepAway		FCPKeepAway
FCPPassing		FCPPassing
GeoFriends2		GeoFriends2
KiloBots		KiloBots
KiloBotsJoin		KiloBotsJoin
KiloBotsJoinSwarm		KiloBotsJoinSwarm
KiloBotsSplit		KiloBotsSplit
KiloBotsSplitSwarm		KiloBotsSplitSwarm
KiloBotsSwarm		KiloBotsSwarm
Navigation		Navigation
Pursuit		Pursuit
SimpleAdv		SimpleAdv
SimpleReference		SimpleReference
SimpleSL		SimpleSL
SimpleSpread		SimpleSpread
SimpleSpread6		SimpleSpread6
SimpleTag		SimpleTag
Traffic		Traffic
simulator		simulator
simulator_fcp		simulator_fcp
simulator_geof2		simulator_geof2
simulator_kilobots		simulator_kilobots
simulator_openai		simulator_openai
.gitignore		.gitignore
BlindGroupUpACBatch.sh		BlindGroupUpACBatch.sh
BlindGroupUpBatch.sh		BlindGroupUpBatch.sh
BlindGroupUpCommsBatch.sh		BlindGroupUpCommsBatch.sh
FCPKeepAway.sh		FCPKeepAway.sh
FCPPass.sh		FCPPass.sh
Helper.py		Helper.py
KBBatchDist.sh		KBBatchDist.sh
KBJBatchDist.sh		KBJBatchDist.sh
KBSBatchDist.sh		KBSBatchDist.sh
NavACBatch.sh		NavACBatch.sh
NavBatch.sh		NavBatch.sh
NavCommsBatch.sh		NavCommsBatch.sh
NavParamBatch.sh		NavParamBatch.sh
PursuitACBatchDist.sh		PursuitACBatchDist.sh
PursuitBatchDist.sh		PursuitBatchDist.sh
PursuitCommsBatchDist.sh		PursuitCommsBatchDist.sh
README.md		README.md
SwarmKBBatchDist.sh		SwarmKBBatchDist.sh
SwarmKBJBatchDist.sh		SwarmKBJBatchDist.sh
SwarmKBSBatchDist.sh		SwarmKBSBatchDist.sh
TrafficBatch.sh		TrafficBatch.sh
TrafficCommsBatch.sh		TrafficCommsBatch.sh
__init__.py		__init__.py
requirements.txt		requirements.txt

Provide feedback