HyperAI

Offline Rl On D4Rl

Metrics

Average Reward

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAverage Reward
decision-transformer-reinforcement-learning73.5
koopman-q-learning-offline-reinforcement-181.8
any-step-dynamics-model-improves-future81