Bug in MultiAgentPolicyManager - and Fix #967

FahmidMorshed · 2023-10-16T20:52:00Z

I have marked all applicable categories:
- exception-raising bug
- RL algorithm bug
- documentation request (i.e. "X is missing from the documentation.")
- new feature request
I have visited the source website
I have searched through the issue tracker for duplicates

I have mentioned version numbers, operating system and environment, where applicable:

import tianshou, gymnasium as gym, torch, numpy, sys
print(tianshou.__version__, gym.__version__, torch.__version__, numpy.__version__, sys.version, sys.platform)

I tried running MARL with multiple SAC as internal policies for each agent for one of my custom-made petting zoo environments. After rigorous debugging, I realized that the MultiAgentPolicyManager in version 0.5.1 needs a train(self, mode: bool = True) method that iteratively sets all the internal policies to mode. Otherwise, the internal policies are never called in eval mode. I am not sure if this is also true when used with other types of internal policies.

The following function solves the issue if added to tianshou/policy/multiagent/mapolicy.py :

def train(self, mode: bool = True):
      """Sets each internal policy in training mode."""
      for agent_id, policy in self.policies.items():
          policy.train(mode)
      return self

I can do a pull-request to the main branch, if needed.

Trinkle23897 · 2023-10-16T23:28:04Z

Oops sorry, and yeah good catch! Feel free to submit a PR.

FahmidMorshed · 2023-10-16T23:49:07Z

Added a RP to fix the issue. Ty.

The trained MARL policies were not performing as expected because the parent class (MultiAgentPolicyManager) needed a train function. Fixes #967

FahmidMorshed mentioned this issue Oct 16, 2023

Fixed the mapolicy train issue #968

Merged

9 tasks

Trinkle23897 closed this as completed in #968 Oct 17, 2023

Trinkle23897 pushed a commit that referenced this issue Oct 17, 2023

Fixed the mapolicy train issue (#968)

bf78410

The trained MARL policies were not performing as expected because the parent class (MultiAgentPolicyManager) needed a train function. Fixes #967

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug in MultiAgentPolicyManager - and Fix #967

Bug in MultiAgentPolicyManager - and Fix #967

FahmidMorshed commented Oct 16, 2023 •

edited by Trinkle23897

Loading

Trinkle23897 commented Oct 16, 2023

FahmidMorshed commented Oct 16, 2023

Bug in MultiAgentPolicyManager - and Fix #967

Bug in MultiAgentPolicyManager - and Fix #967

Comments

FahmidMorshed commented Oct 16, 2023 • edited by Trinkle23897 Loading

Trinkle23897 commented Oct 16, 2023

FahmidMorshed commented Oct 16, 2023

FahmidMorshed commented Oct 16, 2023 •

edited by Trinkle23897

Loading