Date

a year ago

Size

86.64 GB

Organization

Paper URL

arxiv.org

Dataset features:

Large data size：Chinese-LiPS has a total length of about 100 hours and contains 36,208 high-quality voice clips recorded by 207 professional speakers, with good representativeness and diversity.
Covering a wide range of topics: The content covers 9 popular fields including science and technology, health and wellness, culture and history, tourism and exploration, automobile industry, sports events, etc. The themes are evenly distributed, fully reflecting the expression characteristics and terminology density in the context of real teaching and explanation.
High-quality slideshow production：Domain experts design the content and participate in annotation to ensure the accuracy and professionalism of the slide text and image information. The PPT content is clearly structured and beautifully designed, containing rich images and visual semantic information, rather than just a pile of text.
High-quality video recording：The video is recorded by a professional speaker in a quiet environment with high-definition images, covering two modes: lip-reading video (720P) and slide video (1080P), ensuring precise alignment of speech and lip movements, and ensuring consistent and reliable data quality.
Data distribution

Chinese-LiPS.torrent

Seeding 1Downloading 0Completed 69Total Downloads 217

Chinese-LiPS/
- README.md
  2.18 KB
- README.txt
  4.37 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

a year ago

Size

86.64 GB

Organization

Paper URL

arxiv.org

Dataset features:

Large data size：Chinese-LiPS has a total length of about 100 hours and contains 36,208 high-quality voice clips recorded by 207 professional speakers, with good representativeness and diversity.
Covering a wide range of topics: The content covers 9 popular fields including science and technology, health and wellness, culture and history, tourism and exploration, automobile industry, sports events, etc. The themes are evenly distributed, fully reflecting the expression characteristics and terminology density in the context of real teaching and explanation.
High-quality slideshow production：Domain experts design the content and participate in annotation to ensure the accuracy and professionalism of the slide text and image information. The PPT content is clearly structured and beautifully designed, containing rich images and visual semantic information, rather than just a pile of text.
High-quality video recording：The video is recorded by a professional speaker in a quiet environment with high-definition images, covering two modes: lip-reading video (720P) and slide video (1080P), ensuring precise alignment of speech and lip movements, and ensuring consistent and reliable data quality.
Data distribution

Chinese-LiPS.torrent

Seeding 1Downloading 0Completed 69Total Downloads 217

Chinese-LiPS/
- README.md
  2.18 KB
- README.txt
  4.37 KB

Related Datasets

Groundsource Global Flood Events Dataset

3 months ago

RubricHub_v1 Multi-Domain Generative Task Dataset

5 months ago

GroundingME Complex Scene Understanding Evaluation Dataset

6 months ago

MCIF Multimodal Cross-Language Instruction Following Dataset

6 months ago

X-ray Contraband Detection Dataset

6 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Chinese-LiPS Multimodal Speech Recognition Dataset

Dataset features:

Build AI with AI

HyperAI Newsletters

Command Palette

Chinese-LiPS Multimodal Speech Recognition Dataset

Dataset features:

Related Datasets

Groundsource Global Flood Events Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

X-ray Contraband Detection Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

Chinese-LiPS Multimodal Speech Recognition Dataset

Dataset features:

Related Datasets

Groundsource Global Flood Events Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

X-ray Contraband Detection Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

Groundsource Global Flood Events Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

X-ray Contraband Detection Dataset

Related Datasets

Groundsource Global Flood Events Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

X-ray Contraband Detection Dataset