AgentWorld

AgentWorld is a collaborative gaming environment where AI agents can interact, explore, and play together in a 2D multiplayer world. Built as a research platform for multi-agent AI systems, AgentWorld enables developers to create, control, and study AI agents in a rich gaming environment.

Key features for AI research and development:

Multi-agent coordination: Agents can collaborate, compete, and communicate with each other
Rich observation space: Comprehensive API for agents to perceive their environment
Action space: Agents can move, chat, fight, craft, and interact with the world
Persistent world: Agents can learn and adapt over time in a persistent environment
Research-friendly: Built-in APIs for easy integration with AI frameworks

Credits

AgentWorld is built upon the excellent open-source foundation of Kaetram, specifically based on the Kaetram master branch. Kaetram is an open-source RPG game that expands on Little Workshop's BrowserQuest. We're grateful to the Kaetram community for creating such a robust and well-designed game engine that serves as the perfect foundation for multi-agent AI research.

Get Started

Prerequisites

You must first install Node.js to run the project, and optionally install MongoDB to store user data on the server.

NOTE: Node.js

We recommend using Node.js version v20 or higher, following the Long Term Support (LTS) schedule, to have the most stable experience when developing/experimenting with AgentWorld. Older versions may not work with our current dependencies and package manager.

NOTE: MongoDB

MongoDB is not a requirement for AgentWorld to run, but you can store and save user data if you install it and run an online environment with all the features enabled. To do this, see Configuration, and set SKIP_DATABASE=false. If you do choose to install MongoDB, a user is not necessary, but you can enable authentication with the MONGODB_AUTH setting.

Yarn

You will also need to enable Yarn through Corepack, to manage the dependencies.

The preferred way to manage Yarn is through Corepack, a new binary shipped with all Node.js releases [...]

To enable it, run the following command:
corepack enable
https://yarnpkg.com/getting-started/install

Installing

Install the dependencies by simply running

yarn

Configuration

Copy and rename .env.defaults to .env, and modify the contents to fit your needs.

cp .env.defaults .env

Enable Mongodb by setting SKIP_DATABASE=false in the .env file.

Running

To run live development builds, use

yarn dev

To create production builds, run

yarn build

Then, to run each production build, use

yarn start

We recommend to run first with yarn dev to start the server in development mode, so that you can see the changes you make to the code in real time.

Run Agent Console

AgentWorld comes with a CLI tool to deploy an agent and interact with it using natural language. Make sure the game server is running, then in another terminal run:

python agents/console.py \
--provider openai \
--model gpt-4o \
--username test1 \
--password test1

Advanced Agent Configuration

You can configure the agent's initial state using command line options:

# Set initial location and combat skills
python agents/console.py \
--provider openai \
--model gpt-4o \
--username test1 \
--password test1 \
--location 300,200 \
--combat-level-accuracy 45 \
--combat-level-strength 50 \
--combat-level-magic 30

# Start with equipped items and inventory
python agents/console.py \
--provider openai \
--model gpt-4o \
--username test1 \
--password test1 \
--equipped-items bastardsword whitearmor ironhelmet:1:3 \
--inventory-items healingpotion:10 firepotion:5 stick:20

# Full configuration example
python agents/console.py \
--provider openai \
--model gpt-4o \
--username test1 \
--password test1 \
--location 250,180 \
--combat-level-accuracy 60 \
--combat-level-strength 60 \
--combat-level-defense 50 \
--combat-level-health 70 \
--combat-level-magic 40 \
--combat-level-archery 35 \
--equipped-items bastardsword:1:2 whitearmor goldboots \
--inventory-items healingpotion:20 firepotion:10 pythararrow:100 \
--task "explore the world and fight strong enemies"

Initial State Options

--location x,y: Teleport to coordinates at startup (e.g., --location 250,180)
--combat-level-xxx N: Set specific combat skill levels (1-120)
- Available skills: accuracy, strength, defense, health, magic, archery
- HP and MP are automatically restored to maximum after setting combat levels
--equipped-items item1 item2:count:enchant: Set initial equipped items
--inventory-items item1:count item2:count:enchant: Set initial inventory items

Item Format

Basic: itemkey (e.g., coppersword)
With count: itemkey:count (e.g., healingpotion:10)
With enchantment: itemkey:count:enchant (e.g., ironhelmet:1:3)

This will launch a console where you can interact with the agent.

Cheat Commands

You can use cheat commands both interactively and with the --task option:

/teleport x y or /teleport spawn - Teleport to coordinates
/equip itemkey [count] [enchant] - Give and equip item
/setlevel level - Set all combat skills to level
/give itemkey [count] - Give items to inventory
/fullequip - Give essential equipment set
/observe [radius] - Show raw observation JSON data

Examples:

# Interactive mode
🎮 What would you like to do? > /observe 32

# Single-task mode
python agents/console.py --task "/observe 32" --username test1 --password test1
python agents/console.py --task "/teleport 300 200" --username test1 --password test1  
python agents/console.py --task "/setlevel 50" --username test1 --password test1

# Create fresh characters (no password needed)
python agents/console.py --new-character --username freshbot --task "/observe"
python agents/console.py --new-character --username testchar --location 300,200 --combat-level-strength 45

New Character Creation

Use --new-character to create fresh characters or reset existing ones:

No password required - The system handles authentication automatically
Fresh state - Characters start with default level 1 stats and empty inventory
Automatic reset - If username exists, it's reset to default state first
Works with all options - Combine with location, combat levels, equipment, etc.

The /observe command will display the complete environment observation data in JSON format.

Task Runner System

AgentWorld includes a comprehensive task runner system that allows you to execute structured tasks from YAML configuration files. This is ideal for automated testing, benchmarking, and running complex multi-step scenarios.

Prerequisites for Task Runner

Before using the task runner, ensure you have:

Game server running: Start AgentWorld with yarn dev
API keys configured: Set up your LLM provider API keys as environment variables

# For Qwen (Alibaba Cloud DashScope)
export DASHSCOPE_API_KEY=your-dashscope-api-key

# For OpenAI
export OPENAI_API_KEY=sk-your-openai-key

# For Anthropic Claude
export ANTHROPIC_API_KEY=sk-ant-your-claude-key

# For DeepSeek
export DEEPSEEK_API_KEY=sk-your-deepseek-key

Quick Start with Task Runner

Create a task configuration (see data_v0.1_solo/task_1.yaml for example)
Use an agent configuration (see agents/configs/qwen_agent.yaml)
Run the task:

# Run a single task
python3 agents/run.py --task data_v0.1_solo/task_1.yaml --agent agents/configs/qwen_agent.yaml --output logs/

# Run all tasks in a folder
python3 agents/run.py --task-folder data_v0.1_solo/ --agent agents/configs/qwen_agent.yaml --output logs/

Task Configuration Format

Create YAML files that define structured tasks for your agents:

# Task definition
task:
  name: "Smithing Task - Craft Heavy Sword"
  description: "craft a heavy sword using smithing skills"

# Task objectives
objectives:
  primary: "Craft a heavy sword using smithing skills"
  secondary:
    - "Gather any missing materials if needed"
    - "Use smithing station to craft the sword"
    - "Verify successful crafting completion"

# Agent configuration
agent_1:
  username: "smith_test"
  new_character: true
  
  location:
    x: 88
    y: 33
  
  skill_levels:
    smithing: 15
    mining: 10
  
  inventory_items:
    - item: "ironbar"
      count: 3
    - item: "hilt2"
      count: 1
  
  equipped_items:
    - item: "smithinghammer"
      count: 1
      enchant: 0

# Success criteria
success_criteria:
  - "Heavy sword is successfully crafted"

# Optional: Game context and limitations
max_action_steps: 100
relevant_game_context: |
  You are a smithing expert who knows how to use the forge and anvil effectively.

Agent Configuration

Agent configurations define the LLM provider settings and behavior:

# agents/configs/qwen_agent.yaml
agent:
  name: "QwenAgent"
  provider: "qwen"
  
  llm:
    model: "qwen-plus"
    api_key: null  # Uses DASHSCOPE_API_KEY environment variable
    base_url: "https://dashscope.aliyuncs.com/compatible-mode/v1"
    max_tokens: 4096
    temperature: 0.7
    
  game:
    host: "http://localhost:7031"
    timeout: 30
    max_retries: 3

  # Custom system prompt template (optional)
  system_prompt: |
    You are an intelligent AI agent that plays the AgentWorld MMORPG game...
    
    === CURRENT ENVIRONMENT OBSERVATION ===
    {{observation}}
    === END OBSERVATION ===

Key Features

📋 Structured Task Definitions: Define complex multi-step tasks in YAML
🤖 Multi-Agent Support: Run tasks with multiple agents (agent_1, agent_2, etc.)
🔧 Configurable Agents: Customize LLM provider, model, and system prompts
📊 Comprehensive Logging: Detailed logs saved to specified output folder
📈 Execution Metrics: Track duration, iterations, success rates
🎯 Batch Processing: Run entire folders of tasks automatically

Output and Logging

The task runner creates comprehensive logs in your specified output directory:

logs/
├── task_runner_20240115_143022.log     # Main runner log
├── agent_1_20240115_143022.json        # Individual agent execution log
├── task_summary_20240115_143022.json   # Task execution summary
└── folder_summary_20240115_143022.json # Batch execution summary (if running folder)

Advanced Usage

# Run with different agent configurations
python3 agents/run.py --task task.yaml --agent configs/openai_agent.yaml --output results/
python3 agents/run.py --task task.yaml --agent configs/claude_agent.yaml --output results/

# Run multiple tasks with comprehensive logging
python3 agents/run.py --task-folder experiments/ --agent configs/qwen_agent.yaml --output experiment_results/

Task Runner vs Console

Feature	Console (`console.py`)	Task Runner (`run.py`)
Use Case	Interactive development, testing	Automated execution, benchmarking
Input	Command line arguments	YAML configuration files
Output	Console + optional JSON log	Structured logs + summaries
Multi-agent	Single agent per session	Multiple agents per task
Reproducibility	Manual setup each time	Fully reproducible from configs
Batch Processing	No	Yes (entire folders)

Choose console.py for interactive development and testing, and run.py for automated task execution and research experiments.

Troubleshooting

Common Issues

API Key Errors (401 Unauthorized)

Error: API call failed: 401 Client Error: Unauthorized

Verify your API key is correctly set as an environment variable
For Qwen: Check your DashScope API key format and permissions
For other providers: Ensure the API key has sufficient credits/permissions

Game Server Connection Issues

Error: Game server not responding

Ensure AgentWorld is running with yarn dev
Check that the game server is accessible at http://localhost:7031
Verify no firewall is blocking the connection

Task Execution Failures

Check the detailed logs in the output directory
Review the task summary JSON for specific error messages
Ensure task configuration YAML syntax is valid

Debug Commands

# Test game server connectivity
curl http://localhost:7031/ai/observe

# Test API key configuration
echo $DASHSCOPE_API_KEY  # Should show your API key

# Run with verbose logging
python3 agents/run.py --task task.yaml --agent config.yaml --output logs/ 2>&1 | tee debug.log

License

AgentWorld inherits the MPL-2.0 license from Kaetram.

Name		Name	Last commit message	Last commit date
Latest commit History 3,049 Commits
.yarn/releases		.yarn/releases
agents		agents
data_v0.1_multi		data_v0.1_multi
data_v0.1_solo		data_v0.1_solo
docs		docs
packages		packages
scripts		scripts
.commitlintrc		.commitlintrc
.editorconfig		.editorconfig
.env.defaults		.env.defaults
.env.e2e		.env.e2e
.eslintrc		.eslintrc
.gitignore		.gitignore
.gitmodules		.gitmodules
.prettierignore		.prettierignore
.prettierrc		.prettierrc
.yarnrc.yml		.yarnrc.yml
JINJA2_TEMPLATES.md		JINJA2_TEMPLATES.md
LICENSE		LICENSE
README.md		README.md
analyze_tile_data.py		analyze_tile_data.py
package.json		package.json
tsconfig.json		tsconfig.json

License

openagents-org/agentworld

Folders and files

Latest commit

History

Repository files navigation

AgentWorld

Credits

Get Started

Prerequisites

NOTE: Node.js

NOTE: MongoDB

Yarn

Installing

Configuration

Running

Run Agent Console

Advanced Agent Configuration

Initial State Options

Item Format

Cheat Commands

New Character Creation

Task Runner System

Prerequisites for Task Runner

Quick Start with Task Runner

Task Configuration Format

Agent Configuration

Key Features

Output and Logging

Advanced Usage

Task Runner vs Console

Troubleshooting

Common Issues

Debug Commands

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 11

Uh oh!

Languages

Packages