Contents Menu Expand Light mode Dark mode Auto light/dark, in light mode Auto light/dark, in dark mode Skip to content
Arena Documentation
Logo
Arena Documentation

Getting started

  • Getting started
  • Quickstart
  • Classic vs Advanced Training
  • Credits and plans

Tutorials

  • Tutorials
  • Run your first Classic RL experiment
  • Create an Advanced Training project
  • Upload a custom environment
  • Upload a reasoning dataset
  • Upload an object detection dataset
  • Train an LLM
  • View results and checkpoints
  • Deploy and invoke an agent
  • Billing and credits

Account & platform

  • Account
  • Profile and CLI keys
  • Account deletion
  • Organizations
  • Billing
  • Usage and statements

Projects & experiments

  • Projects
  • Experiments
  • Train, halt, and resume
  • Logs and metrics

Experiment wizard

  • Experiment wizard overview
  • Resources
  • Environment
  • Agent
  • Training
  • HPO

Environments

  • Environments
  • Custom environments
  • Validation and profiling

Datasets

  • Datasets
  • Reasoning datasets
  • Preference datasets
  • SFT datasets
  • Tabular datasets
  • Non-tabular datasets

Training

  • Training
  • How evolutionary hyperparameter optimization works
  • Classic RL algorithms
  • LLM algorithms
  • Supervised training
  • Training settings

Results & pipelines

  • Results
  • Checkpoints
  • Pipelines

Agents & inference

  • Agents
  • Create and deploy an agent
  • Invoke an agent
  • Inference contract

On-prem & compute

  • On-prem compute
  • Install a cluster
  • On-prem resource classes
  • Training cluster page

Arena CLI

  • Arena CLI
  • CLI authentication
  • CLI commands
  • On-prem CLI

Reference

  • Glossary
  • Algorithms reference
  • Experiment statuses
  • Plans and permissions
  • Finding your way in Arena

Troubleshooting

  • Troubleshooting
Back to top
Copyright © 2026, AgileRL