HyperAI

AVA Action Recognition Dataset

Date

3 years ago

Size

52.82 MB

Organization

University of California Berkeley

License

CC BY 4.0

特色图像

AVA, short for Atomic Visual Actions, is a video dataset with audio-visual annotations designed to train robots to understand human activities. Each video clip is annotated in detail by annotators, and these annotations reflect the diverse scenes, recording conditions, and expressions of human activities.

The dataset annotations include:

  • Kinetics (AVA-Kinetics): It is a cross between AVA and Kinetics. In order to provide localized action labels on a wider range of visual scenes, the authors provide AVA action labels on Kinetics-700 videos, almost doubling the total number of annotations and increasing the number of videos of certain specific categories by more than 500 times.
  • Actions (AvA-Actions): The AVA dataset densely annotates 80 atomic visual actions in 430 15-minute movie clips. These actions are located in space and time, generating 1.62 million action labels, a large number of which are frequently used.
  • Spoken Activity (AVA ActiveSpeaker, AVA Speech): AVA ActiveSpeaker links sounds and visible faces in AVA v1.0 videos, annotating 3.65 million frames on ~39,000 faces. AVA Speech densely annotates speech activities in AVA v1.0 videos and explicitly annotates 3 background noise conditions, producing ~4,600 annotated clips over 45 hours.
AVA.torrent
Seeding 2Downloading 1Completed 496Total Downloads 525
  • AVA/
    • README.md
      1.9 KB
    • README.txt
      3.79 KB
      • data/
          • AVA Actions (v2.2)/
            • ava_included_timestamps_v2.2.txt
              8.17 KB
            • ava_test_excluded_timestamps_v2.2.csv
              9.27 KB
            • ava_train_excluded_timestamps_v2.2.csv
              11.94 KB
            • ava_train_v2.2.csv.zip
              5.44 MB
            • ava_v2.2.zip
              12.81 MB
            • ava_val_excluded_timestamps_v2.2.csv
              12.81 MB
            • ava_val_v2.2.csv.zip
              14.34 MB
          • AVA Active Speaker (v1.0)/
            • ava_activespeaker_train_v1.0.tar.bz2
              31.69 MB
            • ava_activespeaker_val_v1.0.tar.bz2
              36.55 MB
          • AVA Speech (v1.0)/
            • ava_speech_labels_v1.csv
              38.11 MB
          • AVA-Kinetics (v1.0)/
            • ava_kinetics_v1_0.tar.gz
              52.82 MB