PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

3,003

440

3,003

View on GitHub

Top Related Projects

gym

36,310

A toolkit for developing and comparing reinforcement learning algorithms.

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

open_spiel

4,656

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

football

3,477

Check out the new game server:

multi-agent-emergence-environments

1,719

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

FinRL

11,979

FinRL®: Financial Reinforcement Learning. 🔥

Quick Overview

PettingZoo is a Python library for conducting research in multi-agent reinforcement learning. It provides a wide variety of environments with a simple interface, similar to OpenAI's Gym library but extended for multi-agent scenarios. PettingZoo supports various types of environments, including classic games, Atari games, and more complex simulations.

Pros

Extensive collection of multi-agent environments
Consistent API across different environment types
Compatibility with popular reinforcement learning libraries
Well-documented with examples and tutorials

Cons

Learning curve for users new to multi-agent reinforcement learning
Some environments may require additional dependencies
Performance can vary depending on the complexity of the environment
Limited support for custom environment creation

Code Examples

Importing and initializing an environment:

from pettingzoo.classic import tictactoe_v3

env = tictactoe_v3.env()
env.reset()

Stepping through an environment:

for agent in env.agent_iter():
    observation, reward, termination, truncation, info = env.last()
    if termination or truncation:
        action = None
    else:
        action = env.action_space(agent).sample()  # This is where you would insert your agent's policy
    env.step(action)

Rendering an environment:

env = tictactoe_v3.env(render_mode="human")
env.reset()
for _ in range(100):
    env.render()
    env.step(env.action_space(env.agent_selection).sample())
env.close()

Getting Started

To get started with PettingZoo, follow these steps:

Install PettingZoo:

pip install pettingzoo

Import and use an environment:

from pettingzoo.classic import chess_v5

env = chess_v5.env()
env.reset()

for agent in env.agent_iter():
    observation, reward, termination, truncation, info = env.last()
    if termination or truncation:
        action = None
    else:
        action = env.action_space(agent).sample()
    env.step(action)

This example sets up a chess environment and runs a random agent. Replace the action = env.action_space(agent).sample() line with your own agent's policy to implement custom behavior.

Competitor Comparisons

gym

36,310

A toolkit for developing and comparing reinforcement learning algorithms.

Pros of Gym

Widely adopted and well-established in the reinforcement learning community
Extensive documentation and tutorials available
Simpler API for single-agent environments

Cons of Gym

Limited support for multi-agent environments
Less active development and maintenance compared to PettingZoo
Fewer built-in environments, especially for complex scenarios

Code Comparison

Gym:

import gym
env = gym.make("CartPole-v1")
observation, info = env.reset(seed=42)
for _ in range(1000):
    action = env.action_space.sample()
    observation, reward, terminated, truncated, info = env.step(action)

PettingZoo:

from pettingzoo.butterfly import pistonball_v6
env = pistonball_v6.env()
env.reset()
for agent in env.agent_iter():
    observation, reward, termination, truncation, info = env.last()
    action = env.action_space(agent).sample()
    env.step(action)

The main difference is that PettingZoo uses an agent iteration loop, allowing for multi-agent environments, while Gym focuses on single-agent scenarios with a simpler API. PettingZoo offers more flexibility for complex, multi-agent reinforcement learning tasks, but may require more setup for basic environments.

ml-agents

18,478

Pros of ml-agents

Seamless integration with Unity game engine, allowing for complex 3D environments
Comprehensive toolkit with pre-built components for reinforcement learning
Strong support for multi-agent scenarios and imitation learning

Cons of ml-agents

Limited to Unity environment, less flexible for non-Unity projects
Steeper learning curve for those unfamiliar with Unity
Potentially higher computational requirements for complex 3D simulations

Code Comparison

ml-agents:

from mlagents_envs.environment import UnityEnvironment
from mlagents_envs.side_channel.engine_configuration_channel import EngineConfigurationChannel

channel = EngineConfigurationChannel()
env = UnityEnvironment(file_name="MyEnvironment", side_channels=[channel])

PettingZoo:

from pettingzoo.butterfly import pistonball_v6

env = pistonball_v6.env()
env.reset()
for agent in env.agent_iter():
    observation, reward, done, info = env.last()

The code snippets demonstrate the initialization process for each library. ml-agents focuses on setting up a Unity environment, while PettingZoo offers a more straightforward approach to creating and interacting with multi-agent environments across various domains.

open_spiel

4,656

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Pros of OpenSpiel

Extensive library of game implementations, including classical board games and poker variants
Strong focus on game theory and multi-agent reinforcement learning
Comprehensive documentation and research-oriented design

Cons of OpenSpiel

Steeper learning curve for beginners due to its research-oriented nature
Less emphasis on modern video game environments compared to PettingZoo
Primarily C++ based, which may be less accessible for Python-focused developers

Code Comparison

OpenSpiel (C++):

#include "open_spiel/spiel.h"
#include "open_spiel/games/tic_tac_toe.h"

std::unique_ptr<open_spiel::Game> game = open_spiel::LoadGame("tic_tac_toe");
std::unique_ptr<open_spiel::State> state = game->NewInitialState();

PettingZoo (Python):

from pettingzoo.classic import tictactoe_v3

env = tictactoe_v3.env()
env.reset()

Both libraries offer easy-to-use interfaces for creating game environments, but OpenSpiel's C++ implementation may require more setup compared to PettingZoo's Python-based approach.

football

3,477

Check out the new game server:

Pros of Football

Focused on a specific domain (soccer/football), providing a deep and realistic simulation
Includes a built-in game engine with 3D rendering capabilities
Offers pre-trained models and baselines for benchmarking

Cons of Football

Limited to a single game environment, reducing versatility for general reinforcement learning research
Steeper learning curve due to the complexity of the soccer simulation
Requires more computational resources for 3D rendering and physics simulation

Code Comparison

PettingZoo example:

from pettingzoo.butterfly import knights_archers_zombies_v10
env = knights_archers_zombies_v10.env()
env.reset()
for agent in env.agent_iter():
    observation, reward, done, info = env.last()
    action = env.action_space(agent).sample()
    env.step(action)

Football example:

from gfootball.env import create_environment
env = create_environment(env_name="academy_empty_goal_close", stacked=False)
obs = env.reset()
action = env.action_space.sample()
obs, reward, done, info = env.step(action)

Both repositories provide reinforcement learning environments, but PettingZoo offers a wider range of multi-agent scenarios across various domains, while Football focuses exclusively on soccer simulations with higher fidelity and domain-specific features.

multi-agent-emergence-environments

1,719

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Pros of multi-agent-emergence-environments

Focuses on emergent behavior and complex multi-agent interactions
Provides environments specifically designed for studying collective intelligence
Includes advanced scenarios like hide-and-seek and tool use

Cons of multi-agent-emergence-environments

Limited variety of environments compared to PettingZoo
Less active development and community support
Narrower scope, primarily focused on OpenAI's research interests

Code Comparison

PettingZoo environment initialization:

from pettingzoo.butterfly import knights_archers_zombies_v10
env = knights_archers_zombies_v10.env()
env.reset()

multi-agent-emergence-environments initialization:

from mujoco_worldgen import Floor, WorldBuilder
from mae_envs.wrappers.multi_agent import MultiAgent
env = Floor(WorldBuilder())
env = MultiAgent(env)

Both libraries provide multi-agent environments, but PettingZoo offers a wider range of scenarios and a more standardized API. multi-agent-emergence-environments excels in complex, emergent behavior studies but has a narrower focus and less extensive documentation.

FinRL

11,979

FinRL®: Financial Reinforcement Learning. 🔥

Pros of FinRL

Specialized for financial reinforcement learning tasks
Includes pre-built environments for various financial markets
Provides comprehensive documentation and tutorials for financial RL applications

Cons of FinRL

More limited in scope compared to PettingZoo's diverse environments
Less active community and fewer contributors
Steeper learning curve for users not familiar with financial concepts

Code Comparison

FinRL example:

from finrl.apps import crypto_trading
from finrl.finrl_meta import data_processor as dp

df = dp.download_data(start_date = '2019-09-12', end_date = '2021-05-31', ticker_list = ['BTC/USDT'])
env = crypto_trading.CryptoEnv(df=df)

PettingZoo example:

from pettingzoo.butterfly import knights_archers_zombies_v10

env = knights_archers_zombies_v10.env()
env.reset()
for agent in env.agent_iter():
    observation, reward, done, info = env.last()
    action = env.action_space(agent).sample()
    env.step(action)

Both libraries offer easy-to-use APIs for creating and interacting with environments, but FinRL focuses on financial scenarios while PettingZoo provides a wider range of multi-agent environments across various domains.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

PettingZoo is a Python library for conducting research in multi-agent reinforcement learning, akin to a multi-agent version of Gymnasium.

The documentation website is at pettingzoo.farama.org and we have a public discord server (which we also use to coordinate development work) that you can join here: https://discord.gg/nhvKkYa6qX

Environments

PettingZoo includes the following families of environments:

Atari: Multi-player Atari 2600 games (cooperative, competitive and mixed sum)
Butterfly: Cooperative graphical games developed by us, requiring a high degree of coordination
Classic: Classical games including card games, board games, etc.
MPE: A set of simple nongraphical communication tasks, originally from https://github.com/openai/multiagent-particle-envs
SISL: 3 cooperative environments, originally from https://github.com/sisl/MADRL

Installation

To install the base PettingZoo library: pip install pettingzoo.

This does not include dependencies for all families of environments (some environments can be problematic to install on certain systems).

To install the dependencies for one family, use pip install 'pettingzoo[atari]', or use pip install 'pettingzoo[all]' to install all dependencies.

We support and maintain PettingZoo for Python 3.9, 3.10, 3.11, and 3.12 on Linux and macOS. We will accept PRs related to Windows, but do not officially support it.

Note: Some Linux distributions may require manual installation of cmake, swig, or zlib1g-dev (e.g., sudo apt install cmake swig zlib1g-dev)

Getting started

For an introduction to PettingZoo, see Basic Usage. To create a new environment, see our Environment Creation Tutorial and Custom Environment Examples. For examples of training RL models using PettingZoo see our tutorials:

CleanRL: Implementing PPO: train multiple PPO agents in the Pistonball environment.
Tianshou: Training Agents: train DQN agents in the Tic-Tac-Toe environment.
AgileRL: Training, Curriculums and Self-play: train agents with curriculum learning and self-play in the Connect Four environment.

API

PettingZoo model environments as Agent Environment Cycle (AEC) games, in order to be able to cleanly support all types of multi-agent RL environments under one API and to minimize the potential for certain classes of common bugs.

Using environments in PettingZoo is very similar to Gymnasium, i.e. you initialize an environment via:

from pettingzoo.butterfly import pistonball_v6
env = pistonball_v6.env()

Environments can be interacted with in a manner very similar to Gymnasium:

env.reset()
for agent in env.agent_iter():
    observation, reward, termination, truncation, info = env.last()
    action = None if termination or truncation else env.action_space(agent).sample()  # this is where you would insert your policy
    env.step(action)

For the complete API documentation, please see https://pettingzoo.farama.org/api/aec/

Parallel API

In certain environments, it's a valid to assume that agents take their actions at the same time. For these games, we offer a secondary API to allow for parallel actions, documented at https://pettingzoo.farama.org/api/parallel/

SuperSuit

SuperSuit is a library that includes all commonly used wrappers in RL (frame stacking, observation, normalization, etc.) for PettingZoo and Gymnasium environments with a nice API. We developed it in lieu of wrappers built into PettingZoo. https://github.com/Farama-Foundation/SuperSuit

Environment Versioning

PettingZoo keeps strict versioning for reproducibility reasons. All environments end in a suffix like "_v0". When changes are made to environments that might impact learning results, the number is increased by one to prevent potential confusion.

Citation

To cite this project in publication, please use

@article{terry2021pettingzoo,
  title={Pettingzoo: Gym for multi-agent reinforcement learning},
  author={Terry, J and Black, Benjamin and Grammel, Nathaniel and Jayakumar, Mario and Hari, Ananth and Sullivan, Ryan and Santos, Luis S and Dieffendahl, Clemens and Horsch, Caroline and Perez-Vicente, Rodrigo and others},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  pages={15032--15043},
  year={2021}
}

Project Maintainers

Project Manager: David Gerard - david.gerard.23@ucl.ac.uk
Maintainer: Albert Han - yjhan96@gmail.com.
Maintenance for this project is also contributed by the broader Farama team: farama.org/team.

Top Related Projects

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot