Openai Gym On Pendulum V1

Action Repetition

Average Decisions

Mean Reward

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름	Action Repetition	Average Decisions	Mean Reward	Paper Title	Repository
TLA with Hierarchical Reward Functions	.8073	38.6	-125.02	Creating Hierarchical Dispositions of Needs in an Agent
TLA	.7032	62.31	-154.92	Optimizing Attention and Cognitive Control Costs Using Temporally-Layered Architectures

0 of 2 row(s) selected.