HyperAIHyperAI

Command Palette

Search for a command to run...

a month ago

OceanGym: A Benchmark Environment for Underwater Embodied Agents

OceanGym: A Benchmark Environment for Underwater Embodied Agents

Abstract

We introduce OceanGym, the first comprehensive benchmark for ocean underwaterembodied agents, designed to advance AI in one of the most demanding real-worldenvironments. Unlike terrestrial or aerial domains, underwater settings presentextreme perceptual and decision-making challenges, including low visibility,dynamic ocean currents, making effective agent deployment exceptionallydifficult. OceanGym encompasses eight realistic task domains and a unifiedagent framework driven by Multi-modal Large Language Models (MLLMs), whichintegrates perception, memory, and sequential decision-making. Agents arerequired to comprehend optical and sonar data, autonomously explore complexenvironments, and accomplish long-horizon objectives under these harshconditions. Extensive experiments reveal substantial gaps betweenstate-of-the-art MLLM-driven agents and human experts, highlighting thepersistent difficulty of perception, planning, and adaptability in oceanunderwater environments. By providing a high-fidelity, rigorously designedplatform, OceanGym establishes a testbed for developing robust embodied AI andtransferring these capabilities to real-world autonomous ocean underwatervehicles, marking a decisive step toward intelligent agents capable ofoperating in one of Earth's last unexplored frontiers. The code and data areavailable at https://github.com/OceanGPT/OceanGym.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp