Openai Gym On Lunarlander V2

Average Return

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름	Average Return	Paper Title	Repository
AWR	229	Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Oblique decision tree	272.14	Evolutionary learning of interpretable decision trees

0 of 2 row(s) selected.