HyperAI

Natural Language Processing

Métriques de performance des modèles IA grand public sur diverses tâches, mettant en valeur la technologie de pointe

Benchmarks de performance des modèles IA

Métriques de performance des modèles IA grand public sur diverses tâches, mettant en valeur la technologie de pointe

Deep Clustering

50 articles | 5 benchmarks

Semantic Dependency Parsing

50 articles | 3 benchmarks

Word Alignment

50 articles | 7 benchmarks

Few-Shot Text Classification

49 articles | 8 benchmarks

Lemmatization

49 articles | 0 benchmarks

Multimodal Deep Learning

49 articles | 1 benchmarks

Punctuation Restoration

49 articles | 0 benchmarks

Sentence Compression

49 articles | 1 benchmarks

Sentence Ordering

49 articles | 1 benchmarks

Graph-to-Sequence

48 articles | 2 benchmarks

In-Context Learning

48 articles | 0 benchmarks

Relation Extraction

48 articles | 50 benchmarks

Review Generation

48 articles | 0 benchmarks

Rumour Detection

48 articles | 2 benchmarks

Chatbot

47 articles | 1 benchmarks

Dialogue State Tracking

47 articles | 7 benchmarks

Entity Disambiguation

47 articles | 11 benchmarks

Grammatical Error Detection

47 articles | 4 benchmarks

Lexical Normalization

47 articles | 1 benchmarks

Lexical Simplification

47 articles | 0 benchmarks

Semantic Parsing

47 articles | 20 benchmarks

Text Categorization

47 articles | 0 benchmarks

Conversational Response Selection

46 articles | 15 benchmarks

Conversational Search

46 articles | 0 benchmarks

Dialogue Management

46 articles | 0 benchmarks

Document Summarization

46 articles | 7 benchmarks

Goal-Oriented Dialogue Systems

46 articles | 0 benchmarks

Hope Speech Detection

46 articles | 2 benchmarks

Benchmarking

45 articles | 2 benchmarks

Blocking

45 articles | 5 benchmarks

Dependency Parsing

45 articles | 15 benchmarks

Emotion-Cause Pair Extraction

45 articles | 2 benchmarks

Empathetic Response Generation

45 articles | 1 benchmarks

Extractive Text Summarization

45 articles | 5 benchmarks

Generative Question Answering

45 articles | 2 benchmarks

knowledge editing

45 articles | 1 benchmarks

Sentence Embeddings

45 articles | 0 benchmarks

Twitter Sentiment Analysis

45 articles | 0 benchmarks

Decipherment

44 articles | 0 benchmarks

GSM8K

44 articles | 1 benchmarks

Lexical Complexity Prediction

44 articles | 0 benchmarks

Morphological Tagging

44 articles | 0 benchmarks

TAR

44 articles | 0 benchmarks

Text Augmentation

44 articles | 0 benchmarks

Automated Essay Scoring

43 articles | 1 benchmarks

Chinese Word Segmentation

43 articles | 6 benchmarks

Novelty Detection

43 articles | 0 benchmarks

Prompt Engineering

43 articles | 16 benchmarks

Sentence Embedding

43 articles | 0 benchmarks

Sentence Summarization

42 articles | 0 benchmarks

Answer Generation

42 articles | 2 benchmarks

Arabic Sentiment Analysis

42 articles | 0 benchmarks

Cross-Lingual NER

42 articles | 28 benchmarks

Relation Classification

42 articles | 8 benchmarks

Spoken Language Understanding

42 articles | 5 benchmarks

Ad-Hoc Information Retrieval

41 articles | 1 benchmarks

Event Extraction

41 articles | 9 benchmarks

Learning with noisy labels

41 articles | 20 benchmarks

Named Entity Recognition (NER)

41 articles | 77 benchmarks

Reinforcement Learning

41 articles | 21 benchmarks

Safety Alignment

41 articles | 0 benchmarks

Aspect Extraction

40 articles | 6 benchmarks

Dialogue Evaluation

40 articles | 2 benchmarks

Hallucination Evaluation

40 articles | 0 benchmarks

Multimodal Sentiment Analysis

40 articles | 5 benchmarks

Continual Learning

39 articles | 32 benchmarks

Dialogue Generation

39 articles | 13 benchmarks

Distractor Generation

39 articles | 1 benchmarks

Intent Discovery

39 articles | 3 benchmarks

Knowledge Base Population

39 articles | 1 benchmarks

Script Generation

39 articles | 0 benchmarks

Semi-Supervised Text Classification

39 articles | 2 benchmarks

Sequential Pattern Mining

39 articles | 1 benchmarks

Sign Language Production

39 articles | 0 benchmarks

Spelling Correction

39 articles | 0 benchmarks

Text Infilling

39 articles | 0 benchmarks

Abstractive Text Summarization

38 articles | 18 benchmarks

Conversational Question Answering

38 articles | 1 benchmarks

coreference-resolution

38 articles | 0 benchmarks

Dialect Identification

38 articles | 0 benchmarks

Discourse Parsing

38 articles | 4 benchmarks

Discourse Segmentation

38 articles | 0 benchmarks

Document AI

38 articles | 1 benchmarks

Document Classification

38 articles | 21 benchmarks

Entity Alignment

38 articles | 10 benchmarks

Low Resource Named Entity Recognition

38 articles | 3 benchmarks

Self-Learning

38 articles | 0 benchmarks

Text Compression

38 articles | 0 benchmarks

Toxic Spans Detection

38 articles | 0 benchmarks

Emotion Recognition in Conversation

37 articles | 16 benchmarks

Implicit Discourse Relation Classification

37 articles | 0 benchmarks

Recipe Generation

37 articles | 5 benchmarks

Sentence-Pair Classification

37 articles | 0 benchmarks

Speech-to-Text Translation

37 articles | 10 benchmarks

Temporal Relation Extraction

37 articles | 1 benchmarks

Translation

37 articles | 7 benchmarks

Bias Detection

36 articles | 5 benchmarks

Hate Speech Detection

36 articles | 15 benchmarks

Headline Generation

36 articles | 1 benchmarks

Intent Classification

36 articles | 4 benchmarks

Intent Recognition

36 articles | 1 benchmarks

Language Modelling

36 articles | 55 benchmarks

Multilingual Named Entity Recognition

36 articles | 0 benchmarks

Multilingual NLP

36 articles | 0 benchmarks

Phrase Grounding

36 articles | 5 benchmarks

Question Generation

36 articles | 8 benchmarks

Attribute Value Extraction

35 articles | 4 benchmarks

Community Question Answering

35 articles | 2 benchmarks

Emotion Classification

35 articles | 9 benchmarks

Joint Entity and Relation Extraction

35 articles | 16 benchmarks

Query-focused Summarization

35 articles | 0 benchmarks

Text Style Transfer

35 articles | 2 benchmarks

NER

34 articles | 5 benchmarks

Column Type Annotation

34 articles | 12 benchmarks

Image Deblurring

34 articles | 9 benchmarks

Morphological Disambiguation

34 articles | 0 benchmarks

Open-Ended Question Answering

34 articles | 0 benchmarks

Question-Answer-Generation

34 articles | 0 benchmarks

Reverse Dictionary

34 articles | 0 benchmarks

Short Text Clustering

34 articles | 8 benchmarks

Temporal Information Extraction

34 articles | 2 benchmarks

Hypernym Discovery

33 articles | 3 benchmarks

Knowledge Base Question Answering

33 articles | 10 benchmarks

Language Identification

33 articles | 6 benchmarks

Long-range modeling

33 articles | 2 benchmarks

Low Resource NMT

33 articles | 0 benchmarks

Morphological Inflection

33 articles | 0 benchmarks

Native Language Identification

33 articles | 1 benchmarks

Code Repair

32 articles | 1 benchmarks

Document-level Event Extraction

32 articles | 1 benchmarks

Entity Resolution

32 articles | 11 benchmarks

Prepositional Phrase Attachment

32 articles | 0 benchmarks

Suggestion mining

32 articles | 0 benchmarks

Aspect Category Detection

31 articles | 4 benchmarks

Clickbait Detection

31 articles | 0 benchmarks

HellaSwag

31 articles | 0 benchmarks

Passage Re-Ranking

31 articles | 2 benchmarks

Table annotation

31 articles | 0 benchmarks

Cross-Lingual Natural Language Inference

30 articles | 4 benchmarks

Open-Domain Question Answering

30 articles | 15 benchmarks

Unsupervised Extractive Summarization

30 articles | 3 benchmarks

Word Sense Disambiguation

30 articles | 15 benchmarks

Dialogue Understanding

29 articles | 0 benchmarks

Keyphrase Generation

29 articles | 1 benchmarks

LAMBADA

29 articles | 1 benchmarks

Medical Named Entity Recognition

29 articles | 2 benchmarks

Natural Language Inference

29 articles | 37 benchmarks

Nested Named Entity Recognition

29 articles | 6 benchmarks

Part-Of-Speech Tagging

29 articles | 15 benchmarks

Reading Comprehension

29 articles | 7 benchmarks

Argument Mining

28 articles | 1 benchmarks

Coherence Evaluation

28 articles | 2 benchmarks

Implicatures

28 articles | 1 benchmarks

multimodal generation

28 articles | 1 benchmarks

News Generation

28 articles | 0 benchmarks

Stance Detection

28 articles | 22 benchmarks

Active Learning

27 articles | 1 benchmarks

Aggression Identification

27 articles | 0 benchmarks

Definition Extraction

27 articles | 0 benchmarks

Drug–drug Interaction Extraction

27 articles | 3 benchmarks

Emotion Cause Extraction

27 articles | 1 benchmarks

Entity Extraction using GAN

27 articles | 0 benchmarks

Entity Linking

27 articles | 27 benchmarks

Legal Reasoning

27 articles | 2 benchmarks

Mamba

27 articles | 0 benchmarks

Question Selection

27 articles | 1 benchmarks

Temporal Relation Classification

27 articles | 4 benchmarks

Toxic Comment Classification

27 articles | 4 benchmarks

Transliteration

27 articles | 0 benchmarks

Word Similarity

27 articles | 1 benchmarks

Decoder

26 articles | 0 benchmarks

Opinion Mining

26 articles | 1 benchmarks

Pretrained Multilingual Language Models

26 articles | 0 benchmarks

Question Rewriting

26 articles | 0 benchmarks

Table-based Fact Verification

26 articles | 1 benchmarks

Abstract Argumentation

25 articles | 0 benchmarks

Cross-Lingual Document Classification

25 articles | 10 benchmarks

Cross-Lingual Question Answering

25 articles | 3 benchmarks

Deep Learning

25 articles | 0 benchmarks

Diachronic Word Embeddings

25 articles | 0 benchmarks

Event Causality Identification

25 articles | 0 benchmarks

Low-Resource Neural Machine Translation

25 articles | 1 benchmarks

Protein Folding

25 articles | 0 benchmarks

Timeline Summarization

25 articles | 1 benchmarks

Automatic Post-Editing

24 articles | 0 benchmarks

CCG Supertagging

24 articles | 1 benchmarks

Coreference Resolution

24 articles | 16 benchmarks

Literature Mining

24 articles | 0 benchmarks

Method name prediction

24 articles | 1 benchmarks

Topic Models

24 articles | 6 benchmarks

Unsupervised Dependency Parsing

24 articles | 1 benchmarks

Chinese Named Entity Recognition

23 articles | 7 benchmarks

Emotional Intelligence

23 articles | 1 benchmarks

Few-Shot Relation Classification

23 articles | 4 benchmarks

Image to Video Generation

23 articles | 0 benchmarks

Semantic Retrieval

23 articles | 1 benchmarks

Taxonomy Expansion

23 articles | 0 benchmarks

Text-to-Image Generation

23 articles | 13 benchmarks

Text-To-SQL

23 articles | 10 benchmarks

Winogrande

23 articles | 0 benchmarks

Abuse Detection

22 articles | 0 benchmarks

Cross-Lingual Entity Linking

22 articles | 0 benchmarks

Data-free Knowledge Distillation

22 articles | 2 benchmarks

Dialog Act Classification

22 articles | 1 benchmarks

Extract Aspect

22 articles | 1 benchmarks

Extreme Summarization

22 articles | 4 benchmarks

Scientific Document Summarization

22 articles | 1 benchmarks

Short-Text Conversation

22 articles | 0 benchmarks

Table Retrieval

22 articles | 1 benchmarks

Text Retrieval

22 articles | 16 benchmarks

Word Translation

22 articles | 0 benchmarks

Cloze Test

21 articles | 2 benchmarks

Constituency Grammar Induction

21 articles | 1 benchmarks

Conversational Response Generation

21 articles | 0 benchmarks

Cross Document Coreference Resolution

21 articles | 0 benchmarks

KG-to-Text Generation

21 articles | 11 benchmarks

Large Language Model

21 articles | 2 benchmarks

Linguistic Acceptability

21 articles | 5 benchmarks

Opinion Summarization

21 articles | 0 benchmarks

Passage Ranking

21 articles | 1 benchmarks

Text Clustering

21 articles | 3 benchmarks

Zero-shot Slot Filling

21 articles | 3 benchmarks

Dependency Grammar Induction

20 articles | 2 benchmarks

Entity Typing

20 articles | 8 benchmarks

Intent Detection

20 articles | 19 benchmarks

Key Information Extraction

20 articles | 6 benchmarks

LLM-generated Text Detection

20 articles | 0 benchmarks

Paraphrase Identification

20 articles | 11 benchmarks

Probing Language Models

20 articles | 1 benchmarks

Specificity

20 articles | 0 benchmarks

Text Anonymization

20 articles | 0 benchmarks

Cross-Domain Named Entity Recognition

19 articles | 1 benchmarks

Dynamic Topic Modeling

19 articles | 0 benchmarks

Explanation Generation

19 articles | 5 benchmarks

Fine-Grained Opinion Analysis

19 articles | 1 benchmarks

Formality Style Transfer

19 articles | 1 benchmarks

Linguistic steganography

19 articles | 0 benchmarks

Low Resource Neural Machine Translation

19 articles | 0 benchmarks

Multi-Hop Reading Comprehension

19 articles | 0 benchmarks

Multi-Label Text Classification

19 articles | 20 benchmarks

News Classification

19 articles | 4 benchmarks

Relationship Extraction (Distant Supervised)

19 articles | 2 benchmarks

text annotation

19 articles | 0 benchmarks

Text-to-Video Generation

19 articles | 6 benchmarks

Toponym Resolution

19 articles | 0 benchmarks

XLM-R

19 articles | 0 benchmarks

Aspect Category Sentiment Analysis

18 articles | 1 benchmarks

Component Classification

18 articles | 1 benchmarks

Data-to-Text Generation

18 articles | 26 benchmarks

Event Relation Extraction

18 articles | 0 benchmarks

Language Acquisition

18 articles | 1 benchmarks

Story Generation

18 articles | 5 benchmarks

Answer Selection

17 articles | 6 benchmarks

Chinese Spell Checking

17 articles | 1 benchmarks

Complex Word Identification

17 articles | 0 benchmarks

Concept-To-Text Generation

17 articles | 1 benchmarks

De-identification

17 articles | 0 benchmarks

Gender Bias Detection

17 articles | 0 benchmarks

Memorization

17 articles | 1 benchmarks

nlg evaluation

17 articles | 0 benchmarks

POS Tagging

17 articles | 2 benchmarks

Semantic Role Labeling

17 articles | 7 benchmarks

Topic coverage

17 articles | 3 benchmarks

Vietnamese Datasets

17 articles | 0 benchmarks

Visual Dialog

17 articles | 8 benchmarks

Zero-Shot Stance Detection

17 articles | 0 benchmarks

AMR Parsing

16 articles | 8 benchmarks

Citation Intent Classification

16 articles | 2 benchmarks

Conditional Text Generation

16 articles | 1 benchmarks

Cross-Lingual Information Retrieval

16 articles | 0 benchmarks

Embeddings Evaluation

16 articles | 0 benchmarks

Fake News Detection

16 articles | 10 benchmarks

Keyword Extraction

16 articles | 3 benchmarks

Relational Reasoning

16 articles | 1 benchmarks

Semantic Textual Similarity

16 articles | 13 benchmarks

Story Completion

16 articles | 0 benchmarks

Table-to-Text Generation

16 articles | 8 benchmarks

Text Summarization

16 articles | 37 benchmarks

Transition-Based Dependency Parsing

16 articles | 0 benchmarks

Zero-Shot Text-to-Image Generation

16 articles | 0 benchmarks

Abstract Meaning Representation

15 articles | 0 benchmarks

Action Parsing

15 articles | 1 benchmarks

Aspect-Based Sentiment Analysis (ABSA)

15 articles | 18 benchmarks

Authorship Verification

15 articles | 0 benchmarks

Continual Relation Extraction

15 articles | 0 benchmarks

Dialogue Act Classification

15 articles | 5 benchmarks

Language Modeling

15 articles | 0 benchmarks

Machine Translation

15 articles | 83 benchmarks

PICO

15 articles | 1 benchmarks

Polyphone disambiguation

15 articles | 1 benchmarks

Prosody Prediction

15 articles | 1 benchmarks

Question Answering

15 articles | 149 benchmarks

Temporal Tagging

15 articles | 8 benchmarks

Aspect Term Extraction and Sentiment Classification

14 articles | 1 benchmarks

Cross-Domain Text Classification

14 articles | 0 benchmarks

Dialog Relation Extraction

14 articles | 2 benchmarks

Fact Selection

14 articles | 1 benchmarks

Implicit Relations

14 articles | 1 benchmarks

Key Point Matching

14 articles | 0 benchmarks

Profile Generation

14 articles | 1 benchmarks

Semantic entity labeling

14 articles | 2 benchmarks

Spam detection

14 articles | 1 benchmarks

Table-based Question Answering

14 articles | 0 benchmarks

Table Search

14 articles | 0 benchmarks

Text Generation

14 articles | 71 benchmarks

Automated Writing Evaluation

13 articles | 0 benchmarks

Cell Entity Annotation

13 articles | 5 benchmarks

Comment Generation

13 articles | 0 benchmarks

Commonsense Causal Reasoning

13 articles | 0 benchmarks

DRS Parsing

13 articles | 2 benchmarks

Extractive Summarization

13 articles | 0 benchmarks

Few-shot NER

13 articles | 4 benchmarks

Long-Context Understanding

13 articles | 5 benchmarks

Model Editing

13 articles | 0 benchmarks

Parallel Corpus Mining

13 articles | 0 benchmarks

Persian Sentiment Analysis

13 articles | 0 benchmarks

RAG

13 articles | 0 benchmarks

Text Classification

13 articles | 85 benchmarks

UCCA Parsing

13 articles | 2 benchmarks

Arabic Text Diacritization

12 articles | 2 benchmarks

Causal Emotion Entailment

12 articles | 1 benchmarks

Conversation Disentanglement

12 articles | 3 benchmarks

Humor Detection

12 articles | 1 benchmarks

Key-value Pair Extraction

12 articles | 2 benchmarks

Negation Scope Resolution

12 articles | 4 benchmarks

Predicate Detection

12 articles | 3 benchmarks

Relevance Detection

12 articles | 0 benchmarks

Sentence Pair Modeling

12 articles | 0 benchmarks

Session Search

12 articles | 0 benchmarks

Simultaneous Speech-to-Text Translation

12 articles | 0 benchmarks

Unsupervised Text Classification

12 articles | 4 benchmarks

Author Attribution

11 articles | 0 benchmarks

Columns Property Annotation

11 articles | 4 benchmarks

End-To-End Dialogue Modelling

11 articles | 2 benchmarks

Hint Generation

11 articles | 0 benchmarks

Mathematical Question Answering

11 articles | 2 benchmarks

Multiple Choice Question Answering (MCQA)

11 articles | 31 benchmarks

Nested Mention Recognition

11 articles | 2 benchmarks

Paper generation

11 articles | 2 benchmarks

Passage Retrieval

11 articles | 6 benchmarks

Question Similarity

11 articles | 1 benchmarks

Satire Detection

11 articles | 0 benchmarks

Subjectivity Analysis

11 articles | 2 benchmarks

Toponym Recognition

11 articles | 0 benchmarks

Vietnamese Word Segmentation

11 articles | 0 benchmarks

Zero-Shot Cross-Lingual Transfer

11 articles | 2 benchmarks

Zero-shot Named Entity Recognition (NER)

11 articles | 4 benchmarks

Abusive Language

10 articles | 0 benchmarks

Chunking

10 articles | 5 benchmarks

Cross-Lingual Semantic Textual Similarity

10 articles | 0 benchmarks

Document Ranking

10 articles | 2 benchmarks

Lay Summarization

10 articles | 2 benchmarks

Multi-modal Named Entity Recognition

10 articles | 5 benchmarks

Natural Language Understanding

10 articles | 6 benchmarks

Open-Domain Dialog

10 articles | 1 benchmarks

Semantic Composition

10 articles | 0 benchmarks

Semantic Shift Detection

10 articles | 0 benchmarks

Simultaneous Speech-to-Speech Translation

10 articles | 0 benchmarks

Only Connect Walls Dataset Task 1 (Grouping)

10 articles | 1 benchmarks

Text Simplification

10 articles | 11 benchmarks

Variable Detection

10 articles | 1 benchmarks

Zero-shot Event Extraction

10 articles | 0 benchmarks

AI Agent

9 articles | 0 benchmarks

answerability prediction

9 articles | 1 benchmarks

Binary Relation Extraction

9 articles | 2 benchmarks

Bridging Anaphora Resolution

9 articles | 0 benchmarks

Chinese Zero Pronoun Resolution

9 articles | 0 benchmarks

Connective Detection

9 articles | 0 benchmarks

Document Dating

9 articles | 2 benchmarks

Image-guided Story Ending Generation

9 articles | 2 benchmarks

molecular representation

9 articles | 0 benchmarks

Response Generation

9 articles | 3 benchmarks

Sentiment Analysis

9 articles | 42 benchmarks

Unsupervised Opinion Summarization

9 articles | 3 benchmarks

Vietnamese Social Media Text Processing

9 articles | 0 benchmarks

Author Profiling

8 articles | 0 benchmarks

Belebele

8 articles | 0 benchmarks

Bilingual Lexicon Induction

8 articles | 0 benchmarks

Cross-Lingual Word Embeddings

8 articles | 0 benchmarks

Definition Modelling

8 articles | 0 benchmarks

Dialog Learning

8 articles | 0 benchmarks

Emotion Recognition in Context

8 articles | 4 benchmarks

Grammatical Error Correction

8 articles | 13 benchmarks

Handwritten Chinese Text Recognition

8 articles | 0 benchmarks

Multi-agent Integration

8 articles | 1 benchmarks

Offline Handwritten Chinese Character Recognition

8 articles | 0 benchmarks

Paraphrase Generation

8 articles | 3 benchmarks

Sarcasm Detection

8 articles | 9 benchmarks

Spatial Reasoning

8 articles | 2 benchmarks

Summarization

8 articles | 12 benchmarks

target-oriented opinion words extraction

8 articles | 0 benchmarks

Thai Word Segmentation

8 articles | 2 benchmarks

Unsupervised Sentence Summarization

8 articles | 0 benchmarks

User Simulation

8 articles | 0 benchmarks

Vietnamese Hate Speech Detection

8 articles | 0 benchmarks

WNLI

8 articles | 0 benchmarks

Zero-Shot Machine Translation

8 articles | 0 benchmarks

Aspect-oriented Opinion Extraction

7 articles | 1 benchmarks

Code Documentation Generation

7 articles | 7 benchmarks

Contextualised Word Representations

7 articles | 0 benchmarks

Dialogue Rewriting

7 articles | 3 benchmarks

Few-Shot Stance Detection

7 articles | 0 benchmarks

Image Segmentation

7 articles | 12 benchmarks

Japanese Word Segmentation

7 articles | 1 benchmarks

Meme Classification

7 articles | 3 benchmarks

Occupation prediction

7 articles | 0 benchmarks

Open Intent Discovery

7 articles | 6 benchmarks

Privacy Preserving Deep Learning

7 articles | 0 benchmarks

Propaganda detection

7 articles | 0 benchmarks

Propaganda span identification

7 articles | 0 benchmarks

Query-Based Extractive Summarization

7 articles | 1 benchmarks

Slot Filling

7 articles | 14 benchmarks

SNARKS

7 articles | 0 benchmarks

Text Attribute Transfer

7 articles | 0 benchmarks

Timex normalization

7 articles | 2 benchmarks

Vietnamese Visual Question Answering

7 articles | 0 benchmarks

Word Sense Induction

7 articles | 1 benchmarks

Aspect-Category-Opinion-Sentiment Quadruple Extraction

6 articles | 2 benchmarks

Aspect Category Polarity

6 articles | 1 benchmarks

Cognate Prediction

6 articles | 0 benchmarks

Cross-Lingual Bitext Mining

6 articles | 4 benchmarks

Deep Attention

6 articles | 0 benchmarks

Equation Discovery

6 articles | 0 benchmarks

Fact Verification

6 articles | 3 benchmarks

Grounded language learning

6 articles | 0 benchmarks

Information Retrieval

6 articles | 34 benchmarks

Math Word Problem Solving

6 articles | 13 benchmarks

Mathematical Reasoning

6 articles | 11 benchmarks

Morpheme Segmentaiton

6 articles | 1 benchmarks

News Annotation

6 articles | 0 benchmarks

Open Intent Detection

6 articles | 17 benchmarks

Selection bias

6 articles | 0 benchmarks

Syntax Representation

6 articles | 0 benchmarks

Task-Completion Dialogue Policy Learning

6 articles | 0 benchmarks

Temporal/Casual QA

6 articles | 1 benchmarks

Term Extraction

6 articles | 2 benchmarks

text-to-Cypher

6 articles | 0 benchmarks

Vietnamese Language Models

6 articles | 0 benchmarks

Zero-shot Sentiment Classification

6 articles | 1 benchmarks

Argument Pair Extraction (APE)

5 articles | 1 benchmarks

Binary Condescension Detection

5 articles | 1 benchmarks

Continual Named Entity Recognition

5 articles | 0 benchmarks

Cross-Lingual Transfer

5 articles | 1 benchmarks

Dialogue Interpretation

5 articles | 0 benchmarks

Drug Design

5 articles | 0 benchmarks

DrugProt

5 articles | 1 benchmarks

Job classification

5 articles | 0 benchmarks

Job Prediction

5 articles | 0 benchmarks

Lexical Analysis

5 articles | 0 benchmarks

Long Form Question Answering

5 articles | 0 benchmarks

Multi-label Condescension Detection

5 articles | 1 benchmarks

Multimodal Machine Translation

5 articles | 3 benchmarks

Named Entity Recognition In Vietnamese

5 articles | 2 benchmarks

Personality Alignment

5 articles | 0 benchmarks

Reading Order Detection

5 articles | 2 benchmarks

Riddle Sense

5 articles | 2 benchmarks

Scientific Results Extraction

5 articles | 2 benchmarks

Stereotypical Bias Analysis

5 articles | 1 benchmarks

Text Effects Transfer

5 articles | 0 benchmarks

Unsupervised Part-Of-Speech Tagging

5 articles | 0 benchmarks

Vietnamese Image Captioning

5 articles | 0 benchmarks

Zero-shot Relation Triplet Extraction

5 articles | 2 benchmarks

Abstract Anaphora Resolution

4 articles | 1 benchmarks

Attribute Mining

4 articles | 3 benchmarks

Authorship Attribution

4 articles | 0 benchmarks

Bangla Spelling Error Correction

4 articles | 1 benchmarks

Chemical Indexing

4 articles | 1 benchmarks

Class-level Code Generation

4 articles | 1 benchmarks

Cross-lingual zero-shot dependency parsing

4 articles | 1 benchmarks

Chinese Spelling Error Correction

4 articles | 0 benchmarks

Document-level Relation Extraction

4 articles | 3 benchmarks

Emotional Dialogue Acts

4 articles | 0 benchmarks

Empirical Judgments

4 articles | 1 benchmarks

Extracting COVID-19 Events from Twitter

4 articles | 1 benchmarks

Face Selection

4 articles | 0 benchmarks

Goal-Oriented Dialog

4 articles | 1 benchmarks

Hope Speech Detection for Tamil

4 articles | 1 benchmarks

Information Threading

4 articles | 2 benchmarks

Instruction Following

4 articles | 1 benchmarks

Interactive Evaluation of Dialog

4 articles | 1 benchmarks

Joint Multilingual Sentence Representations

4 articles | 0 benchmarks

Logical Reasoning Question Answering

4 articles | 1 benchmarks

Logical Reasoning Reading Comprehension

4 articles | 0 benchmarks

Misogynistic Aggression Identification

4 articles | 0 benchmarks

Multimodal Attribute Value Extraction

4 articles | 0 benchmarks

Open Information Extraction

4 articles | 13 benchmarks

Page Stream Segmentation

4 articles | 0 benchmarks

Personality Generation

4 articles | 0 benchmarks

Reliable Intelligence Identification

4 articles | 0 benchmarks

Semantic Role Labeling (predicted predicates)

4 articles | 2 benchmarks

Speculation Detection

4 articles | 0 benchmarks

Text-Based Stock Prediction

4 articles | 0 benchmarks

Text-to-video search

4 articles | 0 benchmarks

Timedial

4 articles | 1 benchmarks

Twitter Event Detection

4 articles | 1 benchmarks

Unsupervised Sentence Compression

4 articles | 0 benchmarks

Unsupervised semantic parsing

4 articles | 2 benchmarks

Vietnamese Fact Checking

4 articles | 0 benchmarks

Vietnamese Speech Recognition

4 articles | 0 benchmarks

AI and Safety

3 articles | 0 benchmarks

Aspect Category Sentiment Classification

3 articles | 0 benchmarks

Aspect-Sentiment-Opinion Triplet Extraction

3 articles | 1 benchmarks

Constituency Parsing

3 articles | 4 benchmarks

Conversational Web Navigation

3 articles | 1 benchmarks

Dark Humor Detection

3 articles | 1 benchmarks

Data Mining

3 articles | 0 benchmarks

Dialogue Safety Prediction

3 articles | 2 benchmarks

Disambiguation QA

3 articles | 0 benchmarks

Discourse Marker Prediction

3 articles | 1 benchmarks

Domain Labelling

3 articles | 1 benchmarks

End-to-End RST Parsing

3 articles | 1 benchmarks

English Proverbs

3 articles | 1 benchmarks

Extract aspect-polarity tuple

3 articles | 1 benchmarks

Few-shot HTC

3 articles | 0 benchmarks

Formal Fallacies Syllogisms Negation

3 articles | 0 benchmarks

Hate Speech Normalization

3 articles | 0 benchmarks

Hyperbaton

3 articles | 0 benchmarks

image-sentence alignment

3 articles | 12 benchmarks

Information Extraction

3 articles | 1 benchmarks

KB-to-Language Generation

3 articles | 1 benchmarks

Meme Captioning

3 articles | 0 benchmarks

Memex Question Answering

3 articles | 1 benchmarks

Multi-modal Dialogue Generation

3 articles | 1 benchmarks

Negation Detection

3 articles | 0 benchmarks

Personality Recognition in Conversation

3 articles | 1 benchmarks

Phrase Ranking

3 articles | 2 benchmarks

Phrase Relatedness

3 articles | 1 benchmarks

Phrase Tagging

3 articles | 2 benchmarks

Political Salient Issue Orientation Detection

3 articles | 1 benchmarks

Poll Generation

3 articles | 1 benchmarks

Recognizing Emotion Cause in Conversations

3 articles | 2 benchmarks

Record linking

3 articles | 0 benchmarks

Relational Captioning

3 articles | 1 benchmarks

Ruin Names

3 articles | 0 benchmarks

Sentence Classification

3 articles | 6 benchmarks

Sentence Embeddings For Biomedical Texts

3 articles | 2 benchmarks

Social Media Mental Health Detection

3 articles | 0 benchmarks

Sonnet Generation

3 articles | 0 benchmarks

Speculation Scope Resolution

3 articles | 3 benchmarks

Turning Point Identification

3 articles | 0 benchmarks

Vietnamese Aspect-Based Sentiment Analysis

3 articles | 0 benchmarks

Vietnamese Natural Language Understanding

3 articles | 0 benchmarks

Vietnamese Scene Text

3 articles | 0 benchmarks

Vietnamese Sentiment Analysis

3 articles | 0 benchmarks

4-ary Relation Extraction

2 articles | 1 benchmarks

ArabicMMLU

2 articles | 0 benchmarks

Automatic Writing

2 articles | 0 benchmarks

Claim-Evidence Pair Extraction (CEPE)

2 articles | 1 benchmarks

Claim Extraction with Stance Classification (CESC)

2 articles | 1 benchmarks

Clinical Information Retreival

2 articles | 0 benchmarks

Clinical Language Translation

2 articles | 0 benchmarks

Clinical Section Identification

2 articles | 1 benchmarks

Collaborative Plan Acquisition

2 articles | 0 benchmarks

Context Query Reformulation

2 articles | 0 benchmarks

Croatian Text Diacritization

2 articles | 1 benchmarks

Cross-lingual Text-to-Image Generation

2 articles | 0 benchmarks

Czech Text Diacritization

2 articles | 1 benchmarks

Description-guided molecule generation

2 articles | 1 benchmarks

Document-level Closed Information Extraction

2 articles | 3 benchmarks

Document-level RE with incomplete labeling

2 articles | 2 benchmarks

Email Thread Summarization

2 articles | 2 benchmarks

Event-Driven Trading

2 articles | 0 benchmarks

Fantasy Reasoning

2 articles | 1 benchmarks

few-shot-htc

2 articles | 0 benchmarks

Figure Of Speech Detection

2 articles | 1 benchmarks

French Text Diacritization

2 articles | 1 benchmarks

GRE Reading Comprehension

2 articles | 1 benchmarks

Hate Span Identification

2 articles | 0 benchmarks

Hidden Aspect Detection

2 articles | 0 benchmarks

Hierarchical Text Classification of Blurbs (GermEval 2019)

2 articles | 1 benchmarks

Hierarchical Text Clustering

2 articles | 0 benchmarks

Hope Speech Detection for English

2 articles | 1 benchmarks

Hope Speech Detection for Malayalam

2 articles | 1 benchmarks

Hungarian Text Diacritization

2 articles | 1 benchmarks

Hyper-Relational Extraction

2 articles | 1 benchmarks

Image-to-Text Retrieval

2 articles | 8 benchmarks

incongruity detection

2 articles | 0 benchmarks

Intrusion Detection

2 articles | 5 benchmarks

Irish Text Diacritization

2 articles | 1 benchmarks

Irony Identification

2 articles | 1 benchmarks

Keyphrase Extraction

2 articles | 6 benchmarks

Latvian Text Diacritization

2 articles | 1 benchmarks

legal outcome extraction

2 articles | 0 benchmarks

Machine Reading Comprehension

2 articles | 4 benchmarks

Math Information Retrieval

2 articles | 1 benchmarks

Molecular description generation

2 articles | 0 benchmarks

Movie Dialog Same Or Different

2 articles | 1 benchmarks

Multi-Document Summarization

2 articles | 5 benchmarks

Multi-lingual Text-to-Image Generation

2 articles | 0 benchmarks

multilingual cross-modal retrieval

2 articles | 0 benchmarks

Multilingual Paraphrase Generation

2 articles | 0 benchmarks

Multimodal Abstractive Text Summarization

2 articles | 1 benchmarks

Multimodal Lexical Translation

2 articles | 4 benchmarks

Natural Language Transduction

2 articles | 0 benchmarks

Negation and Speculation Cue Detection

2 articles | 2 benchmarks

Negation and Speculation Scope resolution

2 articles | 0 benchmarks

Nonsense Words Grammar

2 articles | 1 benchmarks

Open Relation Modeling

2 articles | 0 benchmarks

Personalized and Emotional Conversation

2 articles | 1 benchmarks

Political evalutation

2 articles | 0 benchmarks

RACE-h

2 articles | 1 benchmarks

RACE-m

2 articles | 1 benchmarks

Reader-Aware Summarization

2 articles | 1 benchmarks

Role-filler Entity Extraction

2 articles | 1 benchmarks

Romanian Text Diacritization

2 articles | 1 benchmarks

Scientific Concept Extraction

2 articles | 1 benchmarks

Semantic Similarity

2 articles | 26 benchmarks

SemEval-2022 Task 4-1 (Binary PCL Detection)

2 articles | 1 benchmarks

SemEval-2022 Task 4-2 (Multi-label PCL Detection)

2 articles | 1 benchmarks

Semi-Supervised Text Regression

2 articles | 0 benchmarks

Sensitivity Classification

2 articles | 1 benchmarks

Sentiment Dependency Learning

2 articles | 0 benchmarks

Sketch-to-text Generation

2 articles | 0 benchmarks

Slovak Text Diacritization

2 articles | 1 benchmarks

Spanish Text Diacritization

2 articles | 1 benchmarks

SSTOD

2 articles | 2 benchmarks

Task-Oriented Dialogue Systems

2 articles | 4 benchmarks

Text Matching

2 articles | 0 benchmarks

Text-to-GQL

2 articles | 0 benchmarks

Text-Variation

2 articles | 0 benchmarks

Textual Analogy Parsing

2 articles | 0 benchmarks

True or False Question Answering

2 articles | 0 benchmarks

trustable and focussed LLM generated content

2 articles | 0 benchmarks

Turkish Text Diacritization

2 articles | 1 benchmarks

Understanding Fables

2 articles | 1 benchmarks

Unsupervised KG-to-Text Generation

2 articles | 4 benchmarks

Unsupervised Machine Translation

2 articles | 9 benchmarks

ValNov

2 articles | 2 benchmarks

Vietnamese Parsing

2 articles | 0 benchmarks

Vietnamese Text Diacritization

2 articles | 1 benchmarks

Visual Commonsense Tests

2 articles | 1 benchmarks

Workflow Discovery

2 articles | 1 benchmarks

Alignement visualisation

1 articles | 0 benchmarks

Anaphora Resolution

1 articles | 0 benchmarks

ARQMath2

1 articles | 0 benchmarks

Aspect Sentiment Triplet Extraction

1 articles | 4 benchmarks

Bangla Text Detection

1 articles | 1 benchmarks

Blackout Poetry Generation

1 articles | 1 benchmarks

Catalog Extraction

1 articles | 1 benchmarks

Cause-Effect Relation Classification

1 articles | 0 benchmarks

Chinese

1 articles | 0 benchmarks

Clinical Assertion Status Detection

1 articles | 1 benchmarks

Coding Problem Tagging

1 articles | 0 benchmarks

Commonsense Reasoning for RL

1 articles | 1 benchmarks

Complaint Comment Classification

1 articles | 0 benchmarks

Context-specific Spam Detection

1 articles | 1 benchmarks

Contextualized Literature-based Discovery

1 articles | 0 benchmarks

Controllable Language Modelling

1 articles | 0 benchmarks

Conversational Sentiment Quadruple Extraction

1 articles | 2 benchmarks

Counterspeech Detection

1 articles | 1 benchmarks

Cross-Document Language Modeling

1 articles | 2 benchmarks

Cross-Language Text Summarization

1 articles | 0 benchmarks

Cross-Lingual

1 articles | 0 benchmarks

Crowdsourced Text Aggregation

1 articles | 2 benchmarks

Detection of potentially void clauses

1 articles | 1 benchmarks

Dialogue

1 articles | 1 benchmarks

Direct NMT

1 articles | 0 benchmarks

Emergent communications on relations

1 articles | 0 benchmarks

Emotion Detection and Trigger Summarization

1 articles | 0 benchmarks

Entity Typing on DH-KGs

1 articles | 0 benchmarks

Extractive Tags Summarization

1 articles | 0 benchmarks

Fact-based Text Editing

1 articles | 2 benchmarks

FG-1-PG-1

1 articles | 3 benchmarks

Figurative Language Visualization

1 articles | 0 benchmarks

Genetic IE

1 articles | 0 benchmarks

GermEval2024 Shared Task 1 Subtask 1

1 articles | 1 benchmarks

GermEval2024 Shared Task 1 Subtask 2

1 articles | 1 benchmarks

Grapheme Detection

1 articles | 0 benchmarks

Grounded Open Vocabulary Acquisition

1 articles | 0 benchmarks

Hate Intensity Prediction

1 articles | 0 benchmarks

Hate Speech Detection CrisisHateMM Benchmark

1 articles | 0 benchmarks

Hurtful Sentence Completion

1 articles | 1 benchmarks

Joint Entity and Relation Extraction on Scientific Data

1 articles | 0 benchmarks

Joint NER and Classification

1 articles | 0 benchmarks

Latent Aspect Detection

1 articles | 0 benchmarks

Legal Document Translation

1 articles | 0 benchmarks

Line Items Extraction

1 articles | 0 benchmarks

Link prediction on DH-KGs

1 articles | 1 benchmarks

Medical question pair similarity computation

1 articles | 0 benchmarks

Meeting Summarization

1 articles | 2 benchmarks

Metric-Type Identification

1 articles | 0 benchmarks

MMSQL performance

1 articles | 1 benchmarks

Morphological Analysis

1 articles | 0 benchmarks

Multi-Dialect Vietnamese

1 articles | 0 benchmarks

Multi-Grained Named Entity Recognition

1 articles | 0 benchmarks

Multi-Labeled Relation Extraction

1 articles | 0 benchmarks

multi-word expression embedding

1 articles | 0 benchmarks

multi-word expression sememe prediction

1 articles | 0 benchmarks

Multilingual Machine Comprehension in English Hindi

1 articles | 1 benchmarks

Multimedia Generative Script Learning

1 articles | 0 benchmarks

Multimodal GIF Dialog

1 articles | 1 benchmarks

Multimodal Text Prediction

1 articles | 1 benchmarks

Multiview Contextual Commonsense Inference

1 articles | 2 benchmarks

Multlingual Neural Machine Translation

1 articles | 0 benchmarks

Natural Language Landmark Navigation Instructions Generation

1 articles | 1 benchmarks

Open-World Social Event Classification

1 articles | 0 benchmarks

Overlapping Mention Recognition

1 articles | 0 benchmarks

Pcl Detection

1 articles | 0 benchmarks

Persona Dialogue in Story

1 articles | 1 benchmarks

Phrase Vector Embedding

1 articles | 0 benchmarks

Poem meters classification

1 articles | 1 benchmarks

Problem-Solving Deliberation

1 articles | 1 benchmarks

Pronunciation Dictionary Creation

1 articles | 0 benchmarks

Propaganda technique identification

1 articles | 0 benchmarks

quantum circuit classification (classical ML)

1 articles | 0 benchmarks

Query Wellformedness

1 articles | 1 benchmarks

Question-Answer categorization

1 articles | 1 benchmarks

Question Quality Assessment

1 articles | 2 benchmarks

Question to Declarative Sentence

1 articles | 0 benchmarks

Readability optimization

1 articles | 0 benchmarks

relation explanation

1 articles | 0 benchmarks

Relation Mention Extraction

1 articles | 0 benchmarks

Row Annotation

1 articles | 1 benchmarks

Rules-of-thumb Generation

1 articles | 0 benchmarks

Semi-Supervised Formality Style Transfer

1 articles | 0 benchmarks

Speaker Attribution in German Parliamentary Debates (GermEval 2023, subtask 1)

1 articles | 1 benchmarks

Stance Detection (US Election 2020 - Biden)

1 articles | 1 benchmarks

Stance Detection (US Election 2020 - Trump)

1 articles | 1 benchmarks

Summarization Consistency Evaluation

1 articles | 1 benchmarks

Table Type Detection

1 articles | 1 benchmarks

Only Connect Walls Dataset Task 2 (Connections)

1 articles | 0 benchmarks

Taxonomy Learning

1 articles | 0 benchmarks

Text-to-CQL

1 articles | 0 benchmarks

Traditional Spam Detection

1 articles | 1 benchmarks

Tweet-Reply Sentiment Analysis

1 articles | 1 benchmarks

Variable Disambiguation

1 articles | 1 benchmarks

Vietnamese Lexical Normalization

1 articles | 0 benchmarks

Vietnamese Multimodal Sentiment Analysis

1 articles | 0 benchmarks

Visual Storytelling

1 articles | 1 benchmarks

Weakly Supervised Data Denoising

1 articles | 0 benchmarks

Web Page Tagging

1 articles | 0 benchmarks

Word Attribute Transfer

1 articles | 0 benchmarks

Zero-Shot Out-of-Domain Detection

1 articles | 0 benchmarks