HyperAI

Speech

Speech technology refers to the capability of computer systems to process human speech, aiming to achieve speech recognition, synthesis, and understanding. Its goal is to build intelligent systems that can interact efficiently, enhancing user experience. It is widely applied in virtual assistants, customer service systems, voice translation, and other fields, significantly promoting the naturalness and convenience of human-computer interaction.

Speech Dereverberation

6 papers | 5 benchmarks

Spoken language identification

6 papers | 12 benchmarks

Speech Separation

46 papers | 19 benchmarks

Speech Extraction

1 papers | 1 benchmarks

Bandwidth Extension

2 papers | 6 benchmarks

Spoken Language Understanding

20 papers | 5 benchmarks

Audio-Visual Speech Recognition

19 papers | 4 benchmarks

Voice Conversion

3 papers | 3 benchmarks

Cultural Vocal Bursts Intensity Prediction

2 papers | 1 benchmarks

Dialogue Generation

12 papers | 13 benchmarks

Speech Enhancement

63 papers | 14 benchmarks

Keyword Spotting

53 papers | 10 benchmarks

Speaker Identification

9 papers | 4 benchmarks

Speech Synthesis

19 papers | 5 benchmarks

Distant Speech Recognition

4 papers | 2 benchmarks

Acoustic Unit Discovery

1 papers | 1 benchmarks

Vocal Bursts Type Prediction

1 papers | 1 benchmarks

Speech Recognition

135 papers | 148 benchmarks

Speaker Verification

12 papers | 12 benchmarks

Speaker Recognition

2 papers | 1 benchmarks

Accented Speech Recognition

2 papers | 4 benchmarks

Text-To-Speech Synthesis

14 papers | 6 benchmarks

Story Generation

2 papers | 5 benchmarks

Visual Speech Recognition

2 papers | 2 benchmarks

Multimodal Emotion Recognition

12 papers | 7 benchmarks

Lip to Speech Synthesis

1 papers | 1 benchmarks

Noisy Speech Recognition

4 papers | 2 benchmarks

Speech Denoising

1 papers | 2 benchmarks

Arabic Text Diacritization

7 papers | 2 benchmarks

Speaker Diarization

10 papers | 15 benchmarks

Automatic Lyrics Transcription

2 papers | 5 benchmarks

Speech-to-Speech Translation

5 papers | 3 benchmarks

Audio Deepfake Detection

8 papers | 1 benchmarks

Spoken Command Recognition

3 papers | 1 benchmarks

Phone-level pronunciation scoring

6 papers | 1 benchmarks

Automatic Phoneme Recognition

1 papers | 6 benchmarks

Automatic Speech Recognition (ASR)

11 papers | 8 benchmarks

Speech Emotion Recognition

31 papers | 15 benchmarks

Word-level pronunciation scoring

3 papers | 1 benchmarks

A-VB High

1 papers | 1 benchmarks

Utterance-level pronounciation scoring

3 papers | 1 benchmarks

Voice Query Recognition

1 papers | 1 benchmarks

A-VB Culture

1 papers | 1 benchmarks

A-VB Two

1 papers | 1 benchmarks

Speech Synthesis - Gujarati

2 papers | 2 benchmarks

Speech Synthesis - Assamese

1 papers | 1 benchmarks

Speech Synthesis - Bengali

1 papers | 1 benchmarks

Speech Synthesis - Bodo

1 papers | 1 benchmarks

Speech Synthesis - Hindi

1 papers | 1 benchmarks

Speech Synthesis - Kannada

1 papers | 1 benchmarks

Speech Synthesis - Malayalam

1 papers | 1 benchmarks

Speech Synthesis - Manipuri

1 papers | 1 benchmarks

Speech Synthesis - Marathi

1 papers | 1 benchmarks

Speech Synthesis - Rajasthani

1 papers | 1 benchmarks

Speech Synthesis - Tamil

1 papers | 1 benchmarks

Speech Synthesis - Telugu

1 papers | 1 benchmarks