multi-agent-emergence-environments

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

1,719

318

1,719

View on GitHub

Top Related Projects

open_spiel

4,656

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

PettingZoo

2,998

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Gymnasium

9,796

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

football

3,477

Check out the new game server:

smac

1,221

SMAC: The StarCraft Multi-Agent Challenge

Quick Overview

The openai/multi-agent-emergence-environments repository is a collection of environments for studying the emergence of complex behaviors in multi-agent reinforcement learning. It provides a set of challenging scenarios where agents must learn to cooperate or compete to achieve their goals, enabling researchers to explore emergent behaviors and strategies in multi-agent systems.

Pros

Offers a diverse range of multi-agent environments for research
Facilitates the study of emergent behaviors in complex systems
Provides a standardized platform for comparing different multi-agent learning algorithms
Supports integration with popular reinforcement learning frameworks

Cons

Limited documentation and examples for some environments
May require significant computational resources for training complex multi-agent systems
Some environments may be too specialized for general-purpose research
Potential learning curve for researchers new to multi-agent reinforcement learning

Code Examples

# Example 1: Importing and creating a simple environment
import gym
import ma_gym

env = gym.make('PredatorPrey5x5-v0')
obs_n = env.reset()

for _ in range(100):
    actions = [env.action_space.sample() for _ in range(env.n_agents)]
    obs_n, reward_n, done_n, info = env.step(actions)
    env.render()

This example demonstrates how to import and create a simple predator-prey environment, reset it, and run a basic loop with random actions.

# Example 2: Using a custom environment
from ma_gym.envs.traffic_junction import TrafficJunction

env = TrafficJunction()
obs = env.reset()

for _ in range(100):
    actions = [env.action_space.sample() for _ in range(env.n_agents)]
    obs, rewards, dones, info = env.step(actions)
    env.render()

This example shows how to use a custom environment (TrafficJunction) from the ma_gym package, initialize it, and run a simulation loop.

# Example 3: Accessing environment information
env = gym.make('PredatorPrey5x5-v0')
print(f"Number of agents: {env.n_agents}")
print(f"Observation space: {env.observation_space}")
print(f"Action space: {env.action_space}")

This example demonstrates how to access important information about the environment, such as the number of agents, observation space, and action space.

Getting Started

To get started with the multi-agent-emergence-environments:

Clone the repository:

git clone https://github.com/openai/multi-agent-emergence-environments.git
cd multi-agent-emergence-environments

Install the required dependencies:
```
pip install -r requirements.txt
```

Run a simple example:

import gym
import ma_gym

env = gym.make('PredatorPrey5x5-v0')
obs_n = env.reset()
env.render()

for _ in range(100):
    actions = [env.action_space.sample() for _ in range(env.n_agents)]
    obs_n, reward_n, done_n, info = env.step(actions)
    env.render()

This will create a predator-prey environment, reset it, and run a simulation with random actions for 100 steps, rendering the environment at each step.

Competitor Comparisons

open_spiel

4,656

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Pros of open_spiel

Broader scope: Covers a wide range of games and algorithms
More extensive documentation and examples
Active development and community support

Cons of open_spiel

Higher complexity: Steeper learning curve for beginners
Less focus on emergent behaviors in multi-agent systems
Potentially slower execution due to its comprehensive nature

Code Comparison

multi-agent-emergence-environments:

env = gym.make('SumoAnts-v0')
obs = env.reset()
action = env.action_space.sample()
next_obs, reward, done, info = env.step(action)

open_spiel:

game = pyspiel.load_game("tic_tac_toe")
state = game.new_initial_state()
legal_actions = state.legal_actions()
action = np.random.choice(legal_actions)
state.apply_action(action)

Both repositories provide environments for multi-agent reinforcement learning, but open_spiel offers a more comprehensive set of games and algorithms. multi-agent-emergence-environments focuses on emergent behaviors in specific scenarios, while open_spiel provides a broader framework for game-theoretic research. The code examples illustrate the different approaches: multi-agent-emergence-environments uses a gym-like interface, while open_spiel employs a more game-specific API.

ml-agents

18,478

Pros of ml-agents

Integrated with Unity game engine, providing a robust and flexible environment for developing AI agents
Extensive documentation and tutorials, making it more accessible for beginners
Supports a wide range of learning algorithms and environments

Cons of ml-agents

Requires knowledge of Unity and C#, which may have a steeper learning curve for some users
Less focused on multi-agent scenarios compared to multi-agent-emergence-environments

Code Comparison

ml-agents:

public class RollerAgent : Agent
{
    public override void OnEpisodeBegin()
    {
        // Reset agent position and state
    }
}

multi-agent-emergence-environments:

class SimpleAgent(Agent):
    def reset(self):
        # Reset agent position and state
        return self.observe()

Both repositories provide frameworks for developing AI agents, but they differ in their focus and implementation. ml-agents is tightly integrated with Unity, offering a more comprehensive toolkit for game developers and researchers. multi-agent-emergence-environments, on the other hand, is more specialized in studying emergent behaviors in multi-agent systems, using simpler environments built with Python.

PettingZoo

2,998

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Pros of PettingZoo

Broader scope: Supports a wide range of multi-agent environments beyond emergent behavior scenarios
Active development: Regularly updated with new features and environments
Extensive documentation: Comprehensive guides and examples for users

Cons of PettingZoo

Less focused on emergent behavior: May not provide specialized tools for studying emergence
Steeper learning curve: Due to its broader scope, it might be more complex for beginners

Code Comparison

PettingZoo:

from pettingzoo.butterfly import knights_archers_zombies_v10
env = knights_archers_zombies_v10.env()
env.reset()
for agent in env.agent_iter():
    observation, reward, termination, truncation, info = env.last()
    action = env.action_space(agent).sample()
    env.step(action)

Multi-agent-emergence-environments:

from mujoco_worldgen import Floor, WorldBuilder
from mujoco_worldgen.util.types import store_args
from multi_agent_emergence_environments.scenarios.base import BaseScenario

class Scenario(BaseScenario):
    @store_args
    def __init__(self, n_agents):
        super().__init__()

Gymnasium

9,796

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Pros of Gymnasium

Broader scope, supporting a wide range of reinforcement learning environments
More active development and larger community support
Extensive documentation and examples for easier implementation

Cons of Gymnasium

Less focused on multi-agent scenarios
May require additional setup for complex multi-agent environments
Potentially steeper learning curve for beginners in multi-agent RL

Code Comparison

Multi-agent-emergence-environments:

env = gym.make('ma_envs:PredatorPrey5x5-v0')
obs = env.reset()
action = env.action_space.sample()
next_obs, reward, done, info = env.step(action)

Gymnasium:

env = gymnasium.make('CartPole-v1')
observation, info = env.reset(seed=42)
action = env.action_space.sample()
observation, reward, terminated, truncated, info = env.step(action)

While both use similar API structures, Multi-agent-emergence-environments is tailored for multi-agent scenarios, whereas Gymnasium provides a more general-purpose framework for reinforcement learning environments.

football

3,477

Check out the new game server:

Pros of Football

More realistic simulation of a complex team sport
Supports both single-agent and multi-agent scenarios
Includes a comprehensive set of pre-defined scenarios and reward functions

Cons of Football

Limited to a single sport, less versatile for general multi-agent research
Higher computational requirements due to more complex environment
Steeper learning curve for researchers unfamiliar with soccer/football rules

Code Comparison

Football:

env = football_env.create_environment(
    env_name="11_vs_11_stochastic",
    representation="raw",
    rewards="scoring,checkpoints"
)

Multi-agent-emergence-environments:

env = gym.make('SumoAnts-v0')
env.reset()

Key Differences

Multi-agent-emergence-environments focuses on simpler, more abstract environments that allow for emergent behaviors, while Football provides a more structured and realistic simulation of a specific sport. The former is better suited for studying fundamental multi-agent interactions, while the latter is ideal for research on complex team strategies and decision-making in a well-defined context.

smac

1,221

SMAC: The StarCraft Multi-Agent Challenge

Pros of SMAC

More focused on multi-agent reinforcement learning in StarCraft II
Provides a standardized benchmark for MARL research
Includes pre-built scenarios and evaluation metrics

Cons of SMAC

Limited to StarCraft II environment
Less flexibility in creating custom scenarios
May require more domain-specific knowledge to use effectively

Code Comparison

SMAC environment setup:

from smac.env import StarCraft2Env

env = StarCraft2Env(map_name="8m")
env.reset()

Multi-Agent Emergence Environments setup:

from mage import make_env

env = make_env("hide_and_seek")
env.reset()

Key Differences

SMAC is specifically designed for StarCraft II, while Multi-Agent Emergence Environments offer a broader range of scenarios
Multi-Agent Emergence Environments focus on emergent behaviors, while SMAC emphasizes tactical coordination
SMAC provides more structured scenarios, whereas Multi-Agent Emergence Environments allow for more open-ended interactions

Both repositories offer valuable tools for multi-agent research, with SMAC being more specialized and Multi-Agent Emergence Environments providing greater flexibility across different domains.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

Status: Archive (code is provided as-is, no updates expected)

Multiagent emergence environments

Environment generation code for Emergent Tool Use From Multi-Agent Autocurricula (blog)

Installation

This repository depends on the mujoco-worldgen package. You will need to clone the mujoco-worldgen repository and install it and its dependencies:

pip install -r mujoco-worldgen/requirements.txt
pip install -e mujoco-worldgen/
pip install -e multi-agent-emergence-environments/

This repository has been tested only on Mac OS X and Ubuntu 16.04 with Python 3.6

Use

Environment construction works in the following way: You start from the Base environment (defined in mae_envs/envs/base.py) and then you add environment modules (e.g. Boxes, Ramps, RandomWalls, etc.) and then wrappers on top. You can see examples in the mae_envs/envs folder.

If you want to construct a new environment, we highly recommend using the above paradigm in order to minimize code duplication. If you need new objects or game dynamics that don't already exist in this codebase, add them in via a new EnvModule class or a gym.Wrapper class rather than subclassing Base (or mujoco-worldgen's Env class). In general, EnvModules should be used for adding objects or sites to the environment, or otherwise modifying the mujoco simulator; wrappers should be used for everything else (e.g. adding rewards, additional observations, or implementing game mechanics like Lock and Grab).

The environments defined in this repository are:
Hide and seek - mae_envs/envs/hide_and_seek.py - The Hide and Seek environment described in the paper. This encompasses the random rooms, quadrant and food versions of the game (you can switch between them by changing the arguments given to the make_env function in the file)
Box locking - mae_envs/envs/box_locking.py - Encompasses the Lock and Return and Sequential Lock transfer tasks described in the paper.
Blueprint Construction - mae_envs/envs/blueprint_construction.py
Shelter Construction - mae_envs/envs/shelter_construction.py

You can test out environments by using the bin/examine script. Example usage: bin/examine.py base.
You can also use bin/examine to play a saved policy on an environment. There are several environment jsonnets and policies in the examples folder. Example usage:

bin/examine.py examples/hide_and_seek_quadrant.jsonnet examples/hide_and_seek_quadrant.npz

Note that to be able to play saved policies, you will need to install a few additional packages. You can do this via

pip install -r multi-agent-emergence-environments/requirements_ma_policy.txt

Top Related Projects

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot