Search - Arena Documentation

Skip to content

Arena Documentation

Arena Documentation

Getting started

Getting started
Quickstart
Classic vs Advanced Training
Credits and plans

Tutorials

Tutorials
Run your first Classic RL experiment
Create an Advanced Training project
Upload a custom environment
Upload a reasoning dataset
Upload an object detection dataset
Train an LLM
View results and checkpoints
Deploy and invoke an agent
Billing and credits

Account & platform

Account
Profile and CLI keys
Account deletion
Organizations
Billing
Usage and statements

Projects & experiments

Projects
Experiments
Train, halt, and resume
Logs and metrics

Experiment wizard

Experiment wizard overview
Resources
Environment
Agent
Training
HPO

Environments

Environments
Custom environments
Validation and profiling

Datasets

Datasets
Reasoning datasets
Preference datasets
SFT datasets
Tabular datasets
Non-tabular datasets

Training

Training
How evolutionary hyperparameter optimization works
Classic RL algorithms
LLM algorithms
Supervised training
Training settings

Results & pipelines

Results
Checkpoints
Pipelines

Agents & inference

Agents
Create and deploy an agent
Invoke an agent
Inference contract

On-prem & compute

On-prem compute
Install a cluster
On-prem resource classes
Training cluster page

Arena CLI

Arena CLI
CLI authentication
CLI commands
On-prem CLI

Reference

Glossary
Algorithms reference
Experiment statuses
Plans and permissions
Finding your way in Arena

Troubleshooting

Troubleshooting

Copyright © 2026, AgileRL