Algorithms

Rollouts too short → accurate opponent models not fully utilized → low sample efficiency.
Rollouts too long → inaccurate opponent models depart the rollouts from the real trajectory distribution heavily → degraded performance in the environment and low sample efficiency.

Implementaion Details

Language multi-agent

References:

CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society

Camel Multi-Agent Role-Playing Framework

Disaster resource allocation game

UI based on Crafter: Open world survival game for evaluating a wide range of agent abilities within a single environment.

Overview

Research challenges:
Meaningful evaluation:

Play Yourself

python3 -m pip install crafter  # Install Crafter
python3 -m pip install pygame   # Needed for human interface
python3 -m crafter.run_gui      # Start the game

Interface

To install Crafter, refer to the description in their repo.

Evaluation

Agents are allowed a budget of 1M environmnent steps and are evaluated by their success rates of the 22 achievements and by their geometric mean score. Example scripts for computing these are included in the analysis directory of the repository.

Reward: The sparse reward is +1 for unlocking an achievement during the episode and -0.1 or +0.1 for lost or regenerated health points. Results should be reported not as reward but as success rates and score.
Success rates: The success rates of the 22 achievemnts are computed as the percentage across all training episodes in which the achievement was unlocked, allowing insights into the ability spectrum of an agent.
Crafter score: The score is the geometric mean of success rates, so that improvements on difficult achievements contribute more than improvements on achievements with already high success rates.

Scoreboards

Please create a pull request if you would like to add your or another algorithm to the scoreboards. For the reinforcement learning and unsupervised agents categories, the interaction budget is 1M. The external knowledge category is defined more broadly.

Name		Name	Last commit message	Last commit date
Latest commit History 236 Commits
.github		.github
MA_algorithms		MA_algorithms
configs		configs
crafter.egg-info		crafter.egg-info
crafter		crafter
data		data
documents		documents
models/AORPO/IM/test		models/AORPO/IM/test
multiagent		multiagent
scripts		scripts
.DS_Store		.DS_Store
.gitignore		.gitignore
Agent_async.py		Agent_async.py
LICENSE		LICENSE
README.md		README.md
debug.py		debug.py
env_wrappers.py		env_wrappers.py
main.py		main.py
main_dqn.py		main_dqn.py
main_mb.py		main_mb.py
requirements.txt		requirements.txt
server.py		server.py
setup.py		setup.py
train.sh		train.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Algorithms

Heuristic-based

Model-free multi-agent

Model-based multi-agent

Language multi-agent

Disaster resource allocation game

Overview

Play Yourself

Interface

Evaluation

Scoreboards

Reinforcement Learning

Unsupervised Agents

External Knowledge

Baselines

Join this project

Questions

About

Releases

Sponsor this project

Packages

Languages

License

Hither1/disaster-resource-allocation-game

Folders and files

Latest commit

History

Repository files navigation

Algorithms

Heuristic-based

Model-free multi-agent

Model-based multi-agent

Language multi-agent

Disaster resource allocation game

Overview

Play Yourself

Interface

Evaluation

Scoreboards

Reinforcement Learning

Unsupervised Agents

External Knowledge

Baselines

Join this project

Questions

About

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages