URLB Reinforcement Learning Dataset
Date
3 years ago
Publish URL
URLB stands for Unsupervised Reinforcement Learning Benchmark, which is an unsupervised reinforcement learning dataset. URLB consists of two stages: a pre-training stage without rewards and a downstream task adaptation stage with external rewards. Based on the DeepMind control suite, this dataset provides 12 continuous control tasks from three fields for evaluation.