HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
SOTA
Audio

Audio

Audio technology refers to the techniques for processing, analyzing, and synthesizing sound through computer systems. Its goal is to achieve high-quality sound signal processing, enhance auditory experiences, and support applications such as speech recognition and audio enhancement. Audio technology has significant application value in smart devices, online education, entertainment industries, and more, significantly improving user experience and promoting the naturalization and intelligence of human-computer interaction.

Audio Classification

92 papers | 22 benchmarks

1 papers | 15 benchmarks

Downbeat Tracking

1 papers | 13 benchmarks

Few-Shot Audio Classification

2 papers | 10 benchmarks

Bandwidth Extension

2 papers | 6 benchmarks

Language Identification

5 papers | 6 benchmarks

Acoustic Scene Classification

5 papers | 5 benchmarks

Sound Event Detection

16 papers | 5 benchmarks

Sound Event Localization and Detection

5 papers | 5 benchmarks

Speech Synthesis

19 papers | 5 benchmarks

Audio Super-Resolution

8 papers | 4 benchmarks

Audio Denoising

1 papers | 3 benchmarks

Audio Generation

22 papers | 3 benchmarks

Environmental Sound Classification

3 papers | 3 benchmarks

Instrument Recognition

5 papers | 3 benchmarks

Music Source Separation

26 papers | 3 benchmarks

Target Sound Extraction

2 papers | 3 benchmarks

Voice Anti-spoofing

6 papers | 3 benchmarks

Voice Conversion

3 papers | 3 benchmarks

Audio captioning

19 papers | 2 benchmarks

Audio Source Separation

2 papers | 2 benchmarks

Text-to-Music Generation

15 papers | 2 benchmarks

Zero-shot Audio Captioning

3 papers | 2 benchmarks

Acoustic Novelty Detection

3 papers | 1 benchmarks

Active Speaker Localization

1 papers | 1 benchmarks

Audio Quality Assessment

1 papers | 1 benchmarks

9 papers | 1 benchmarks

Cadenza 1 - Task 1 - Headphone

1 papers | 1 benchmarks

Cadenza 1 - Task 2 - In Car

1 papers | 1 benchmarks

Direction of Arrival Estimation

1 papers | 1 benchmarks

Directional Hearing

1 papers | 1 benchmarks

fake voice detection

1 papers | 1 benchmarks

Lung Sound Classification

3 papers | 1 benchmarks

Music Generation

1 papers | 1 benchmarks

Real-time Directional Hearing

1 papers | 1 benchmarks

Retrieval-augmented Few-shot In-context Audio Captioning

5 papers | 1 benchmarks

Streaming Target Sound Extraction

1 papers | 1 benchmarks

Video-to-Sound Generation

7 papers | 1 benchmarks

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
SOTA
Audio

Audio

Audio technology refers to the techniques for processing, analyzing, and synthesizing sound through computer systems. Its goal is to achieve high-quality sound signal processing, enhance auditory experiences, and support applications such as speech recognition and audio enhancement. Audio technology has significant application value in smart devices, online education, entertainment industries, and more, significantly improving user experience and promoting the naturalization and intelligence of human-computer interaction.

Audio Classification

92 papers | 22 benchmarks

1 papers | 15 benchmarks

Downbeat Tracking

1 papers | 13 benchmarks

Few-Shot Audio Classification

2 papers | 10 benchmarks

Bandwidth Extension

2 papers | 6 benchmarks

Language Identification

5 papers | 6 benchmarks

Acoustic Scene Classification

5 papers | 5 benchmarks

Sound Event Detection

16 papers | 5 benchmarks

Sound Event Localization and Detection

5 papers | 5 benchmarks

Speech Synthesis

19 papers | 5 benchmarks

Audio Super-Resolution

8 papers | 4 benchmarks

Audio Denoising

1 papers | 3 benchmarks

Audio Generation

22 papers | 3 benchmarks

Environmental Sound Classification

3 papers | 3 benchmarks

Instrument Recognition

5 papers | 3 benchmarks

Music Source Separation

26 papers | 3 benchmarks

Target Sound Extraction

2 papers | 3 benchmarks

Voice Anti-spoofing

6 papers | 3 benchmarks

Voice Conversion

3 papers | 3 benchmarks

Audio captioning

19 papers | 2 benchmarks

Audio Source Separation

2 papers | 2 benchmarks

Text-to-Music Generation

15 papers | 2 benchmarks

Zero-shot Audio Captioning

3 papers | 2 benchmarks

Acoustic Novelty Detection

3 papers | 1 benchmarks

Active Speaker Localization

1 papers | 1 benchmarks

Audio Quality Assessment

1 papers | 1 benchmarks

9 papers | 1 benchmarks

Cadenza 1 - Task 1 - Headphone

1 papers | 1 benchmarks

Cadenza 1 - Task 2 - In Car

1 papers | 1 benchmarks

Direction of Arrival Estimation

1 papers | 1 benchmarks

Directional Hearing

1 papers | 1 benchmarks

fake voice detection

1 papers | 1 benchmarks

Lung Sound Classification

3 papers | 1 benchmarks

Music Generation

1 papers | 1 benchmarks

Real-time Directional Hearing

1 papers | 1 benchmarks

Retrieval-augmented Few-shot In-context Audio Captioning

5 papers | 1 benchmarks

Streaming Target Sound Extraction

1 papers | 1 benchmarks

Video-to-Sound Generation

7 papers | 1 benchmarks

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)