HyperAI

D4Rl On D4Rl

المقاييس

Average Reward

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجAverage Reward
rethinking-attention-with-performers63.8
reformer-the-efficient-transformer-163.9
cosformer-rethinking-softmax-in-attention-167.8
model-based-offline-reinforcement-learning88.2
transformers-are-rnns-fast-autoregressive64.4
koopman-q-learning-offline-reinforcement-181.8
flowformer-linearizing-transformers-with73.5
primal-attention-self-attention-through77.5
decision-transformer-reinforcement-learning72.2