Natural Language Processing
Métriques de performance des modèles IA grand public sur diverses tâches, mettant en valeur la technologie de pointe
Benchmarks de performance des modèles IA
Métriques de performance des modèles IA grand public sur diverses tâches, mettant en valeur la technologie de pointe
Deep Clustering
50 articles | 5 benchmarks
Semantic Dependency Parsing
50 articles | 3 benchmarks
Word Alignment
50 articles | 7 benchmarks
Few-Shot Text Classification
49 articles | 8 benchmarks
Lemmatization
49 articles | 0 benchmarks
Multimodal Deep Learning
49 articles | 1 benchmarks
Punctuation Restoration
49 articles | 0 benchmarks
Sentence Compression
49 articles | 1 benchmarks
Sentence Ordering
49 articles | 1 benchmarks
Graph-to-Sequence
48 articles | 2 benchmarks
In-Context Learning
48 articles | 0 benchmarks
Relation Extraction
48 articles | 50 benchmarks
Review Generation
48 articles | 0 benchmarks
Rumour Detection
48 articles | 2 benchmarks
Chatbot
47 articles | 1 benchmarks
Dialogue State Tracking
47 articles | 7 benchmarks
Entity Disambiguation
47 articles | 11 benchmarks
Grammatical Error Detection
47 articles | 4 benchmarks
Lexical Normalization
47 articles | 1 benchmarks
Lexical Simplification
47 articles | 0 benchmarks
Semantic Parsing
47 articles | 20 benchmarks
Text Categorization
47 articles | 0 benchmarks
Conversational Response Selection
46 articles | 15 benchmarks
Conversational Search
46 articles | 0 benchmarks
Dialogue Management
46 articles | 0 benchmarks
Document Summarization
46 articles | 7 benchmarks
Goal-Oriented Dialogue Systems
46 articles | 0 benchmarks
Hope Speech Detection
46 articles | 2 benchmarks
Benchmarking
45 articles | 2 benchmarks
Blocking
45 articles | 5 benchmarks
Dependency Parsing
45 articles | 15 benchmarks
Emotion-Cause Pair Extraction
45 articles | 2 benchmarks
Empathetic Response Generation
45 articles | 1 benchmarks
Extractive Text Summarization
45 articles | 5 benchmarks
Generative Question Answering
45 articles | 2 benchmarks
knowledge editing
45 articles | 1 benchmarks
Sentence Embeddings
45 articles | 0 benchmarks
Twitter Sentiment Analysis
45 articles | 0 benchmarks
Decipherment
44 articles | 0 benchmarks
GSM8K
44 articles | 1 benchmarks
Lexical Complexity Prediction
44 articles | 0 benchmarks
Morphological Tagging
44 articles | 0 benchmarks
TAR
44 articles | 0 benchmarks
Text Augmentation
44 articles | 0 benchmarks
Automated Essay Scoring
43 articles | 1 benchmarks
Chinese Word Segmentation
43 articles | 6 benchmarks
Novelty Detection
43 articles | 0 benchmarks
Prompt Engineering
43 articles | 16 benchmarks
Sentence Embedding
43 articles | 0 benchmarks
Sentence Summarization
42 articles | 0 benchmarks
Answer Generation
42 articles | 2 benchmarks
Arabic Sentiment Analysis
42 articles | 0 benchmarks
Cross-Lingual NER
42 articles | 28 benchmarks
Relation Classification
42 articles | 8 benchmarks
Spoken Language Understanding
42 articles | 5 benchmarks
Ad-Hoc Information Retrieval
41 articles | 1 benchmarks
Event Extraction
41 articles | 9 benchmarks
Learning with noisy labels
41 articles | 20 benchmarks
Named Entity Recognition (NER)
41 articles | 77 benchmarks
Reinforcement Learning
41 articles | 21 benchmarks
Safety Alignment
41 articles | 0 benchmarks
Aspect Extraction
40 articles | 6 benchmarks
Dialogue Evaluation
40 articles | 2 benchmarks
Hallucination Evaluation
40 articles | 0 benchmarks
Multimodal Sentiment Analysis
40 articles | 5 benchmarks
Continual Learning
39 articles | 32 benchmarks
Dialogue Generation
39 articles | 13 benchmarks
Distractor Generation
39 articles | 1 benchmarks
Intent Discovery
39 articles | 3 benchmarks
Knowledge Base Population
39 articles | 1 benchmarks
Script Generation
39 articles | 0 benchmarks
Semi-Supervised Text Classification
39 articles | 2 benchmarks
Sequential Pattern Mining
39 articles | 1 benchmarks
Sign Language Production
39 articles | 0 benchmarks
Spelling Correction
39 articles | 0 benchmarks
Text Infilling
39 articles | 0 benchmarks
Abstractive Text Summarization
38 articles | 18 benchmarks
Conversational Question Answering
38 articles | 1 benchmarks
coreference-resolution
38 articles | 0 benchmarks
Dialect Identification
38 articles | 0 benchmarks
Discourse Parsing
38 articles | 4 benchmarks
Discourse Segmentation
38 articles | 0 benchmarks
Document AI
38 articles | 1 benchmarks
Document Classification
38 articles | 21 benchmarks
Entity Alignment
38 articles | 10 benchmarks
Low Resource Named Entity Recognition
38 articles | 3 benchmarks
Self-Learning
38 articles | 0 benchmarks
Text Compression
38 articles | 0 benchmarks
Toxic Spans Detection
38 articles | 0 benchmarks
Emotion Recognition in Conversation
37 articles | 16 benchmarks
Implicit Discourse Relation Classification
37 articles | 0 benchmarks
Recipe Generation
37 articles | 5 benchmarks
Sentence-Pair Classification
37 articles | 0 benchmarks
Speech-to-Text Translation
37 articles | 10 benchmarks
Temporal Relation Extraction
37 articles | 1 benchmarks
Translation
37 articles | 7 benchmarks
Bias Detection
36 articles | 5 benchmarks
Hate Speech Detection
36 articles | 15 benchmarks
Headline Generation
36 articles | 1 benchmarks
Intent Classification
36 articles | 4 benchmarks
Intent Recognition
36 articles | 1 benchmarks
Language Modelling
36 articles | 55 benchmarks
Multilingual Named Entity Recognition
36 articles | 0 benchmarks
Multilingual NLP
36 articles | 0 benchmarks
Phrase Grounding
36 articles | 5 benchmarks
Question Generation
36 articles | 8 benchmarks
Attribute Value Extraction
35 articles | 4 benchmarks
Community Question Answering
35 articles | 2 benchmarks
Emotion Classification
35 articles | 9 benchmarks
Joint Entity and Relation Extraction
35 articles | 16 benchmarks
Query-focused Summarization
35 articles | 0 benchmarks
Text Style Transfer
35 articles | 2 benchmarks
NER
34 articles | 5 benchmarks
Column Type Annotation
34 articles | 12 benchmarks
Image Deblurring
34 articles | 9 benchmarks
Morphological Disambiguation
34 articles | 0 benchmarks
Open-Ended Question Answering
34 articles | 0 benchmarks
Question-Answer-Generation
34 articles | 0 benchmarks
Reverse Dictionary
34 articles | 0 benchmarks
Short Text Clustering
34 articles | 8 benchmarks
Temporal Information Extraction
34 articles | 2 benchmarks
Hypernym Discovery
33 articles | 3 benchmarks
Knowledge Base Question Answering
33 articles | 10 benchmarks
Language Identification
33 articles | 6 benchmarks
Long-range modeling
33 articles | 2 benchmarks
Low Resource NMT
33 articles | 0 benchmarks
Morphological Inflection
33 articles | 0 benchmarks
Native Language Identification
33 articles | 1 benchmarks
Code Repair
32 articles | 1 benchmarks
Document-level Event Extraction
32 articles | 1 benchmarks
Entity Resolution
32 articles | 11 benchmarks
Prepositional Phrase Attachment
32 articles | 0 benchmarks
Suggestion mining
32 articles | 0 benchmarks
Aspect Category Detection
31 articles | 4 benchmarks
Clickbait Detection
31 articles | 0 benchmarks
HellaSwag
31 articles | 0 benchmarks
Passage Re-Ranking
31 articles | 2 benchmarks
Table annotation
31 articles | 0 benchmarks
Cross-Lingual Natural Language Inference
30 articles | 4 benchmarks
Open-Domain Question Answering
30 articles | 15 benchmarks
Unsupervised Extractive Summarization
30 articles | 3 benchmarks
Word Sense Disambiguation
30 articles | 15 benchmarks
Dialogue Understanding
29 articles | 0 benchmarks
Keyphrase Generation
29 articles | 1 benchmarks
LAMBADA
29 articles | 1 benchmarks
Medical Named Entity Recognition
29 articles | 2 benchmarks
Natural Language Inference
29 articles | 37 benchmarks
Nested Named Entity Recognition
29 articles | 6 benchmarks
Part-Of-Speech Tagging
29 articles | 15 benchmarks
Reading Comprehension
29 articles | 7 benchmarks
Argument Mining
28 articles | 1 benchmarks
Coherence Evaluation
28 articles | 2 benchmarks
Implicatures
28 articles | 1 benchmarks
multimodal generation
28 articles | 1 benchmarks
News Generation
28 articles | 0 benchmarks
Stance Detection
28 articles | 22 benchmarks
Active Learning
27 articles | 1 benchmarks
Aggression Identification
27 articles | 0 benchmarks
Definition Extraction
27 articles | 0 benchmarks
Drug–drug Interaction Extraction
27 articles | 3 benchmarks
Emotion Cause Extraction
27 articles | 1 benchmarks
Entity Extraction using GAN
27 articles | 0 benchmarks
Entity Linking
27 articles | 27 benchmarks
Legal Reasoning
27 articles | 2 benchmarks
Mamba
27 articles | 0 benchmarks
Question Selection
27 articles | 1 benchmarks
Temporal Relation Classification
27 articles | 4 benchmarks
Toxic Comment Classification
27 articles | 4 benchmarks
Transliteration
27 articles | 0 benchmarks
Word Similarity
27 articles | 1 benchmarks
Decoder
26 articles | 0 benchmarks
Opinion Mining
26 articles | 1 benchmarks
Pretrained Multilingual Language Models
26 articles | 0 benchmarks
Question Rewriting
26 articles | 0 benchmarks
Table-based Fact Verification
26 articles | 1 benchmarks
Abstract Argumentation
25 articles | 0 benchmarks
Cross-Lingual Document Classification
25 articles | 10 benchmarks
Cross-Lingual Question Answering
25 articles | 3 benchmarks
Deep Learning
25 articles | 0 benchmarks
Diachronic Word Embeddings
25 articles | 0 benchmarks
Event Causality Identification
25 articles | 0 benchmarks
Low-Resource Neural Machine Translation
25 articles | 1 benchmarks
Protein Folding
25 articles | 0 benchmarks
Timeline Summarization
25 articles | 1 benchmarks
Automatic Post-Editing
24 articles | 0 benchmarks
CCG Supertagging
24 articles | 1 benchmarks
Coreference Resolution
24 articles | 16 benchmarks
Literature Mining
24 articles | 0 benchmarks
Method name prediction
24 articles | 1 benchmarks
Topic Models
24 articles | 6 benchmarks
Unsupervised Dependency Parsing
24 articles | 1 benchmarks
Chinese Named Entity Recognition
23 articles | 7 benchmarks
Emotional Intelligence
23 articles | 1 benchmarks
Few-Shot Relation Classification
23 articles | 4 benchmarks
Image to Video Generation
23 articles | 0 benchmarks
Semantic Retrieval
23 articles | 1 benchmarks
Taxonomy Expansion
23 articles | 0 benchmarks
Text-to-Image Generation
23 articles | 13 benchmarks
Text-To-SQL
23 articles | 10 benchmarks
Winogrande
23 articles | 0 benchmarks
Abuse Detection
22 articles | 0 benchmarks
Cross-Lingual Entity Linking
22 articles | 0 benchmarks
Data-free Knowledge Distillation
22 articles | 2 benchmarks
Dialog Act Classification
22 articles | 1 benchmarks
Extract Aspect
22 articles | 1 benchmarks
Extreme Summarization
22 articles | 4 benchmarks
Scientific Document Summarization
22 articles | 1 benchmarks
Short-Text Conversation
22 articles | 0 benchmarks
Table Retrieval
22 articles | 1 benchmarks
Text Retrieval
22 articles | 16 benchmarks
Word Translation
22 articles | 0 benchmarks
Cloze Test
21 articles | 2 benchmarks
Constituency Grammar Induction
21 articles | 1 benchmarks
Conversational Response Generation
21 articles | 0 benchmarks
Cross Document Coreference Resolution
21 articles | 0 benchmarks
KG-to-Text Generation
21 articles | 11 benchmarks
Large Language Model
21 articles | 2 benchmarks
Linguistic Acceptability
21 articles | 5 benchmarks
Opinion Summarization
21 articles | 0 benchmarks
Passage Ranking
21 articles | 1 benchmarks
Text Clustering
21 articles | 3 benchmarks
Zero-shot Slot Filling
21 articles | 3 benchmarks
Dependency Grammar Induction
20 articles | 2 benchmarks
Entity Typing
20 articles | 8 benchmarks
Intent Detection
20 articles | 19 benchmarks
Key Information Extraction
20 articles | 6 benchmarks
LLM-generated Text Detection
20 articles | 0 benchmarks
Paraphrase Identification
20 articles | 11 benchmarks
Probing Language Models
20 articles | 1 benchmarks
Specificity
20 articles | 0 benchmarks
Text Anonymization
20 articles | 0 benchmarks
Cross-Domain Named Entity Recognition
19 articles | 1 benchmarks
Dynamic Topic Modeling
19 articles | 0 benchmarks
Explanation Generation
19 articles | 5 benchmarks
Fine-Grained Opinion Analysis
19 articles | 1 benchmarks
Formality Style Transfer
19 articles | 1 benchmarks
Linguistic steganography
19 articles | 0 benchmarks
Low Resource Neural Machine Translation
19 articles | 0 benchmarks
Multi-Hop Reading Comprehension
19 articles | 0 benchmarks
Multi-Label Text Classification
19 articles | 20 benchmarks
News Classification
19 articles | 4 benchmarks
Relationship Extraction (Distant Supervised)
19 articles | 2 benchmarks
text annotation
19 articles | 0 benchmarks
Text-to-Video Generation
19 articles | 6 benchmarks
Toponym Resolution
19 articles | 0 benchmarks
XLM-R
19 articles | 0 benchmarks
Aspect Category Sentiment Analysis
18 articles | 1 benchmarks
Component Classification
18 articles | 1 benchmarks
Data-to-Text Generation
18 articles | 26 benchmarks
Event Relation Extraction
18 articles | 0 benchmarks
Language Acquisition
18 articles | 1 benchmarks
Story Generation
18 articles | 5 benchmarks
Answer Selection
17 articles | 6 benchmarks
Chinese Spell Checking
17 articles | 1 benchmarks
Complex Word Identification
17 articles | 0 benchmarks
Concept-To-Text Generation
17 articles | 1 benchmarks
De-identification
17 articles | 0 benchmarks
Gender Bias Detection
17 articles | 0 benchmarks
Memorization
17 articles | 1 benchmarks
nlg evaluation
17 articles | 0 benchmarks
POS Tagging
17 articles | 2 benchmarks
Semantic Role Labeling
17 articles | 7 benchmarks
Topic coverage
17 articles | 3 benchmarks
Vietnamese Datasets
17 articles | 0 benchmarks
Visual Dialog
17 articles | 8 benchmarks
Zero-Shot Stance Detection
17 articles | 0 benchmarks
AMR Parsing
16 articles | 8 benchmarks
Citation Intent Classification
16 articles | 2 benchmarks
Conditional Text Generation
16 articles | 1 benchmarks
Cross-Lingual Information Retrieval
16 articles | 0 benchmarks
Embeddings Evaluation
16 articles | 0 benchmarks
Fake News Detection
16 articles | 10 benchmarks
Keyword Extraction
16 articles | 3 benchmarks
Relational Reasoning
16 articles | 1 benchmarks
Semantic Textual Similarity
16 articles | 13 benchmarks
Story Completion
16 articles | 0 benchmarks
Table-to-Text Generation
16 articles | 8 benchmarks
Text Summarization
16 articles | 37 benchmarks
Transition-Based Dependency Parsing
16 articles | 0 benchmarks
Zero-Shot Text-to-Image Generation
16 articles | 0 benchmarks
Abstract Meaning Representation
15 articles | 0 benchmarks
Action Parsing
15 articles | 1 benchmarks
Aspect-Based Sentiment Analysis (ABSA)
15 articles | 18 benchmarks
Authorship Verification
15 articles | 0 benchmarks
Continual Relation Extraction
15 articles | 0 benchmarks
Dialogue Act Classification
15 articles | 5 benchmarks
Language Modeling
15 articles | 0 benchmarks
Machine Translation
15 articles | 83 benchmarks
PICO
15 articles | 1 benchmarks
Polyphone disambiguation
15 articles | 1 benchmarks
Prosody Prediction
15 articles | 1 benchmarks
Question Answering
15 articles | 149 benchmarks
Temporal Tagging
15 articles | 8 benchmarks
Aspect Term Extraction and Sentiment Classification
14 articles | 1 benchmarks
Cross-Domain Text Classification
14 articles | 0 benchmarks
Dialog Relation Extraction
14 articles | 2 benchmarks
Fact Selection
14 articles | 1 benchmarks
Implicit Relations
14 articles | 1 benchmarks
Key Point Matching
14 articles | 0 benchmarks
Profile Generation
14 articles | 1 benchmarks
Semantic entity labeling
14 articles | 2 benchmarks
Spam detection
14 articles | 1 benchmarks
Table-based Question Answering
14 articles | 0 benchmarks
Table Search
14 articles | 0 benchmarks
Text Generation
14 articles | 71 benchmarks
Automated Writing Evaluation
13 articles | 0 benchmarks
Cell Entity Annotation
13 articles | 5 benchmarks
Comment Generation
13 articles | 0 benchmarks
Commonsense Causal Reasoning
13 articles | 0 benchmarks
DRS Parsing
13 articles | 2 benchmarks
Extractive Summarization
13 articles | 0 benchmarks
Few-shot NER
13 articles | 4 benchmarks
Long-Context Understanding
13 articles | 5 benchmarks
Model Editing
13 articles | 0 benchmarks
Parallel Corpus Mining
13 articles | 0 benchmarks
Persian Sentiment Analysis
13 articles | 0 benchmarks
RAG
13 articles | 0 benchmarks
Text Classification
13 articles | 85 benchmarks
UCCA Parsing
13 articles | 2 benchmarks
Arabic Text Diacritization
12 articles | 2 benchmarks
Causal Emotion Entailment
12 articles | 1 benchmarks
Conversation Disentanglement
12 articles | 3 benchmarks
Humor Detection
12 articles | 1 benchmarks
Key-value Pair Extraction
12 articles | 2 benchmarks
Negation Scope Resolution
12 articles | 4 benchmarks
Predicate Detection
12 articles | 3 benchmarks
Relevance Detection
12 articles | 0 benchmarks
Sentence Pair Modeling
12 articles | 0 benchmarks
Session Search
12 articles | 0 benchmarks
Simultaneous Speech-to-Text Translation
12 articles | 0 benchmarks
Unsupervised Text Classification
12 articles | 4 benchmarks
Author Attribution
11 articles | 0 benchmarks
Columns Property Annotation
11 articles | 4 benchmarks
End-To-End Dialogue Modelling
11 articles | 2 benchmarks
Hint Generation
11 articles | 0 benchmarks
Mathematical Question Answering
11 articles | 2 benchmarks
Multiple Choice Question Answering (MCQA)
11 articles | 31 benchmarks
Nested Mention Recognition
11 articles | 2 benchmarks
Paper generation
11 articles | 2 benchmarks
Passage Retrieval
11 articles | 6 benchmarks
Question Similarity
11 articles | 1 benchmarks
Satire Detection
11 articles | 0 benchmarks
Subjectivity Analysis
11 articles | 2 benchmarks
Toponym Recognition
11 articles | 0 benchmarks
Vietnamese Word Segmentation
11 articles | 0 benchmarks
Zero-Shot Cross-Lingual Transfer
11 articles | 2 benchmarks
Zero-shot Named Entity Recognition (NER)
11 articles | 4 benchmarks
Abusive Language
10 articles | 0 benchmarks
Chunking
10 articles | 5 benchmarks
Cross-Lingual Semantic Textual Similarity
10 articles | 0 benchmarks
Document Ranking
10 articles | 2 benchmarks
Lay Summarization
10 articles | 2 benchmarks
Multi-modal Named Entity Recognition
10 articles | 5 benchmarks
Natural Language Understanding
10 articles | 6 benchmarks
Open-Domain Dialog
10 articles | 1 benchmarks
Semantic Composition
10 articles | 0 benchmarks
Semantic Shift Detection
10 articles | 0 benchmarks
Simultaneous Speech-to-Speech Translation
10 articles | 0 benchmarks
Only Connect Walls Dataset Task 1 (Grouping)
10 articles | 1 benchmarks
Text Simplification
10 articles | 11 benchmarks
Variable Detection
10 articles | 1 benchmarks
Zero-shot Event Extraction
10 articles | 0 benchmarks
AI Agent
9 articles | 0 benchmarks
answerability prediction
9 articles | 1 benchmarks
Binary Relation Extraction
9 articles | 2 benchmarks
Bridging Anaphora Resolution
9 articles | 0 benchmarks
Chinese Zero Pronoun Resolution
9 articles | 0 benchmarks
Connective Detection
9 articles | 0 benchmarks
Document Dating
9 articles | 2 benchmarks
Image-guided Story Ending Generation
9 articles | 2 benchmarks
molecular representation
9 articles | 0 benchmarks
Response Generation
9 articles | 3 benchmarks
Sentiment Analysis
9 articles | 42 benchmarks
Unsupervised Opinion Summarization
9 articles | 3 benchmarks
Vietnamese Social Media Text Processing
9 articles | 0 benchmarks
Author Profiling
8 articles | 0 benchmarks
Belebele
8 articles | 0 benchmarks
Bilingual Lexicon Induction
8 articles | 0 benchmarks
Cross-Lingual Word Embeddings
8 articles | 0 benchmarks
Definition Modelling
8 articles | 0 benchmarks
Dialog Learning
8 articles | 0 benchmarks
Emotion Recognition in Context
8 articles | 4 benchmarks
Grammatical Error Correction
8 articles | 13 benchmarks
Handwritten Chinese Text Recognition
8 articles | 0 benchmarks
Multi-agent Integration
8 articles | 1 benchmarks
Offline Handwritten Chinese Character Recognition
8 articles | 0 benchmarks
Paraphrase Generation
8 articles | 3 benchmarks
Sarcasm Detection
8 articles | 9 benchmarks
Spatial Reasoning
8 articles | 2 benchmarks
Summarization
8 articles | 12 benchmarks
target-oriented opinion words extraction
8 articles | 0 benchmarks
Thai Word Segmentation
8 articles | 2 benchmarks
Unsupervised Sentence Summarization
8 articles | 0 benchmarks
User Simulation
8 articles | 0 benchmarks
Vietnamese Hate Speech Detection
8 articles | 0 benchmarks
WNLI
8 articles | 0 benchmarks
Zero-Shot Machine Translation
8 articles | 0 benchmarks
Aspect-oriented Opinion Extraction
7 articles | 1 benchmarks
Code Documentation Generation
7 articles | 7 benchmarks
Contextualised Word Representations
7 articles | 0 benchmarks
Dialogue Rewriting
7 articles | 3 benchmarks
Few-Shot Stance Detection
7 articles | 0 benchmarks
Image Segmentation
7 articles | 12 benchmarks
Japanese Word Segmentation
7 articles | 1 benchmarks
Meme Classification
7 articles | 3 benchmarks
Occupation prediction
7 articles | 0 benchmarks
Open Intent Discovery
7 articles | 6 benchmarks
Privacy Preserving Deep Learning
7 articles | 0 benchmarks
Propaganda detection
7 articles | 0 benchmarks
Propaganda span identification
7 articles | 0 benchmarks
Query-Based Extractive Summarization
7 articles | 1 benchmarks
Slot Filling
7 articles | 14 benchmarks
SNARKS
7 articles | 0 benchmarks
Text Attribute Transfer
7 articles | 0 benchmarks
Timex normalization
7 articles | 2 benchmarks
Vietnamese Visual Question Answering
7 articles | 0 benchmarks
Word Sense Induction
7 articles | 1 benchmarks
Aspect-Category-Opinion-Sentiment Quadruple Extraction
6 articles | 2 benchmarks
Aspect Category Polarity
6 articles | 1 benchmarks
Cognate Prediction
6 articles | 0 benchmarks
Cross-Lingual Bitext Mining
6 articles | 4 benchmarks
Deep Attention
6 articles | 0 benchmarks
Equation Discovery
6 articles | 0 benchmarks
Fact Verification
6 articles | 3 benchmarks
Grounded language learning
6 articles | 0 benchmarks
Information Retrieval
6 articles | 34 benchmarks
Math Word Problem Solving
6 articles | 13 benchmarks
Mathematical Reasoning
6 articles | 11 benchmarks
Morpheme Segmentaiton
6 articles | 1 benchmarks
News Annotation
6 articles | 0 benchmarks
Open Intent Detection
6 articles | 17 benchmarks
Selection bias
6 articles | 0 benchmarks
Syntax Representation
6 articles | 0 benchmarks
Task-Completion Dialogue Policy Learning
6 articles | 0 benchmarks
Temporal/Casual QA
6 articles | 1 benchmarks
Term Extraction
6 articles | 2 benchmarks
text-to-Cypher
6 articles | 0 benchmarks
Vietnamese Language Models
6 articles | 0 benchmarks
Zero-shot Sentiment Classification
6 articles | 1 benchmarks
Argument Pair Extraction (APE)
5 articles | 1 benchmarks
Binary Condescension Detection
5 articles | 1 benchmarks
Continual Named Entity Recognition
5 articles | 0 benchmarks
Cross-Lingual Transfer
5 articles | 1 benchmarks
Dialogue Interpretation
5 articles | 0 benchmarks
Drug Design
5 articles | 0 benchmarks
DrugProt
5 articles | 1 benchmarks
Job classification
5 articles | 0 benchmarks
Job Prediction
5 articles | 0 benchmarks
Lexical Analysis
5 articles | 0 benchmarks
Long Form Question Answering
5 articles | 0 benchmarks
Multi-label Condescension Detection
5 articles | 1 benchmarks
Multimodal Machine Translation
5 articles | 3 benchmarks
Named Entity Recognition In Vietnamese
5 articles | 2 benchmarks
Personality Alignment
5 articles | 0 benchmarks
Reading Order Detection
5 articles | 2 benchmarks
Riddle Sense
5 articles | 2 benchmarks
Scientific Results Extraction
5 articles | 2 benchmarks
Stereotypical Bias Analysis
5 articles | 1 benchmarks
Text Effects Transfer
5 articles | 0 benchmarks
Unsupervised Part-Of-Speech Tagging
5 articles | 0 benchmarks
Vietnamese Image Captioning
5 articles | 0 benchmarks
Zero-shot Relation Triplet Extraction
5 articles | 2 benchmarks
Abstract Anaphora Resolution
4 articles | 1 benchmarks
Attribute Mining
4 articles | 3 benchmarks
Authorship Attribution
4 articles | 0 benchmarks
Bangla Spelling Error Correction
4 articles | 1 benchmarks
Chemical Indexing
4 articles | 1 benchmarks
Class-level Code Generation
4 articles | 1 benchmarks
Cross-lingual zero-shot dependency parsing
4 articles | 1 benchmarks
Chinese Spelling Error Correction
4 articles | 0 benchmarks
Document-level Relation Extraction
4 articles | 3 benchmarks
Emotional Dialogue Acts
4 articles | 0 benchmarks
Empirical Judgments
4 articles | 1 benchmarks
Extracting COVID-19 Events from Twitter
4 articles | 1 benchmarks
Face Selection
4 articles | 0 benchmarks
Goal-Oriented Dialog
4 articles | 1 benchmarks
Hope Speech Detection for Tamil
4 articles | 1 benchmarks
Information Threading
4 articles | 2 benchmarks
Instruction Following
4 articles | 1 benchmarks
Interactive Evaluation of Dialog
4 articles | 1 benchmarks
Joint Multilingual Sentence Representations
4 articles | 0 benchmarks
Logical Reasoning Question Answering
4 articles | 1 benchmarks
Logical Reasoning Reading Comprehension
4 articles | 0 benchmarks
Misogynistic Aggression Identification
4 articles | 0 benchmarks
Multimodal Attribute Value Extraction
4 articles | 0 benchmarks
Open Information Extraction
4 articles | 13 benchmarks
Page Stream Segmentation
4 articles | 0 benchmarks
Personality Generation
4 articles | 0 benchmarks
Reliable Intelligence Identification
4 articles | 0 benchmarks
Semantic Role Labeling (predicted predicates)
4 articles | 2 benchmarks
Speculation Detection
4 articles | 0 benchmarks
Text-Based Stock Prediction
4 articles | 0 benchmarks
Text-to-video search
4 articles | 0 benchmarks
Timedial
4 articles | 1 benchmarks
Twitter Event Detection
4 articles | 1 benchmarks
Unsupervised Sentence Compression
4 articles | 0 benchmarks
Unsupervised semantic parsing
4 articles | 2 benchmarks
Vietnamese Fact Checking
4 articles | 0 benchmarks
Vietnamese Speech Recognition
4 articles | 0 benchmarks
AI and Safety
3 articles | 0 benchmarks
Aspect Category Sentiment Classification
3 articles | 0 benchmarks
Aspect-Sentiment-Opinion Triplet Extraction
3 articles | 1 benchmarks
Constituency Parsing
3 articles | 4 benchmarks
Conversational Web Navigation
3 articles | 1 benchmarks
Dark Humor Detection
3 articles | 1 benchmarks
Data Mining
3 articles | 0 benchmarks
Dialogue Safety Prediction
3 articles | 2 benchmarks
Disambiguation QA
3 articles | 0 benchmarks
Discourse Marker Prediction
3 articles | 1 benchmarks
Domain Labelling
3 articles | 1 benchmarks
End-to-End RST Parsing
3 articles | 1 benchmarks
English Proverbs
3 articles | 1 benchmarks
Extract aspect-polarity tuple
3 articles | 1 benchmarks
Few-shot HTC
3 articles | 0 benchmarks
Formal Fallacies Syllogisms Negation
3 articles | 0 benchmarks
Hate Speech Normalization
3 articles | 0 benchmarks
Hyperbaton
3 articles | 0 benchmarks
image-sentence alignment
3 articles | 12 benchmarks
Information Extraction
3 articles | 1 benchmarks
KB-to-Language Generation
3 articles | 1 benchmarks
Meme Captioning
3 articles | 0 benchmarks
Memex Question Answering
3 articles | 1 benchmarks
Multi-modal Dialogue Generation
3 articles | 1 benchmarks
Negation Detection
3 articles | 0 benchmarks
Personality Recognition in Conversation
3 articles | 1 benchmarks
Phrase Ranking
3 articles | 2 benchmarks
Phrase Relatedness
3 articles | 1 benchmarks
Phrase Tagging
3 articles | 2 benchmarks
Political Salient Issue Orientation Detection
3 articles | 1 benchmarks
Poll Generation
3 articles | 1 benchmarks
Recognizing Emotion Cause in Conversations
3 articles | 2 benchmarks
Record linking
3 articles | 0 benchmarks
Relational Captioning
3 articles | 1 benchmarks
Ruin Names
3 articles | 0 benchmarks
Sentence Classification
3 articles | 6 benchmarks
Sentence Embeddings For Biomedical Texts
3 articles | 2 benchmarks
Social Media Mental Health Detection
3 articles | 0 benchmarks
Sonnet Generation
3 articles | 0 benchmarks
Speculation Scope Resolution
3 articles | 3 benchmarks
Turning Point Identification
3 articles | 0 benchmarks
Vietnamese Aspect-Based Sentiment Analysis
3 articles | 0 benchmarks
Vietnamese Natural Language Understanding
3 articles | 0 benchmarks
Vietnamese Scene Text
3 articles | 0 benchmarks
Vietnamese Sentiment Analysis
3 articles | 0 benchmarks
4-ary Relation Extraction
2 articles | 1 benchmarks
ArabicMMLU
2 articles | 0 benchmarks
Automatic Writing
2 articles | 0 benchmarks
Claim-Evidence Pair Extraction (CEPE)
2 articles | 1 benchmarks
Claim Extraction with Stance Classification (CESC)
2 articles | 1 benchmarks
Clinical Information Retreival
2 articles | 0 benchmarks
Clinical Language Translation
2 articles | 0 benchmarks
Clinical Section Identification
2 articles | 1 benchmarks
Collaborative Plan Acquisition
2 articles | 0 benchmarks
Context Query Reformulation
2 articles | 0 benchmarks
Croatian Text Diacritization
2 articles | 1 benchmarks
Cross-lingual Text-to-Image Generation
2 articles | 0 benchmarks
Czech Text Diacritization
2 articles | 1 benchmarks
Description-guided molecule generation
2 articles | 1 benchmarks
Document-level Closed Information Extraction
2 articles | 3 benchmarks
Document-level RE with incomplete labeling
2 articles | 2 benchmarks
Email Thread Summarization
2 articles | 2 benchmarks
Event-Driven Trading
2 articles | 0 benchmarks
Fantasy Reasoning
2 articles | 1 benchmarks
few-shot-htc
2 articles | 0 benchmarks
Figure Of Speech Detection
2 articles | 1 benchmarks
French Text Diacritization
2 articles | 1 benchmarks
GRE Reading Comprehension
2 articles | 1 benchmarks
Hate Span Identification
2 articles | 0 benchmarks
Hidden Aspect Detection
2 articles | 0 benchmarks
Hierarchical Text Classification of Blurbs (GermEval 2019)
2 articles | 1 benchmarks
Hierarchical Text Clustering
2 articles | 0 benchmarks
Hope Speech Detection for English
2 articles | 1 benchmarks
Hope Speech Detection for Malayalam
2 articles | 1 benchmarks
Hungarian Text Diacritization
2 articles | 1 benchmarks
Hyper-Relational Extraction
2 articles | 1 benchmarks
Image-to-Text Retrieval
2 articles | 8 benchmarks
incongruity detection
2 articles | 0 benchmarks
Intrusion Detection
2 articles | 5 benchmarks
Irish Text Diacritization
2 articles | 1 benchmarks
Irony Identification
2 articles | 1 benchmarks
Keyphrase Extraction
2 articles | 6 benchmarks
Latvian Text Diacritization
2 articles | 1 benchmarks
legal outcome extraction
2 articles | 0 benchmarks
Machine Reading Comprehension
2 articles | 4 benchmarks
Math Information Retrieval
2 articles | 1 benchmarks
Molecular description generation
2 articles | 0 benchmarks
Movie Dialog Same Or Different
2 articles | 1 benchmarks
Multi-Document Summarization
2 articles | 5 benchmarks
Multi-lingual Text-to-Image Generation
2 articles | 0 benchmarks
multilingual cross-modal retrieval
2 articles | 0 benchmarks
Multilingual Paraphrase Generation
2 articles | 0 benchmarks
Multimodal Abstractive Text Summarization
2 articles | 1 benchmarks
Multimodal Lexical Translation
2 articles | 4 benchmarks
Natural Language Transduction
2 articles | 0 benchmarks
Negation and Speculation Cue Detection
2 articles | 2 benchmarks
Negation and Speculation Scope resolution
2 articles | 0 benchmarks
Nonsense Words Grammar
2 articles | 1 benchmarks
Open Relation Modeling
2 articles | 0 benchmarks
Personalized and Emotional Conversation
2 articles | 1 benchmarks
Political evalutation
2 articles | 0 benchmarks
RACE-h
2 articles | 1 benchmarks
RACE-m
2 articles | 1 benchmarks
Reader-Aware Summarization
2 articles | 1 benchmarks
Role-filler Entity Extraction
2 articles | 1 benchmarks
Romanian Text Diacritization
2 articles | 1 benchmarks
Scientific Concept Extraction
2 articles | 1 benchmarks
Semantic Similarity
2 articles | 26 benchmarks
SemEval-2022 Task 4-1 (Binary PCL Detection)
2 articles | 1 benchmarks
SemEval-2022 Task 4-2 (Multi-label PCL Detection)
2 articles | 1 benchmarks
Semi-Supervised Text Regression
2 articles | 0 benchmarks
Sensitivity Classification
2 articles | 1 benchmarks
Sentiment Dependency Learning
2 articles | 0 benchmarks
Sketch-to-text Generation
2 articles | 0 benchmarks
Slovak Text Diacritization
2 articles | 1 benchmarks
Spanish Text Diacritization
2 articles | 1 benchmarks
SSTOD
2 articles | 2 benchmarks
Task-Oriented Dialogue Systems
2 articles | 4 benchmarks
Text Matching
2 articles | 0 benchmarks
Text-to-GQL
2 articles | 0 benchmarks
Text-Variation
2 articles | 0 benchmarks
Textual Analogy Parsing
2 articles | 0 benchmarks
True or False Question Answering
2 articles | 0 benchmarks
trustable and focussed LLM generated content
2 articles | 0 benchmarks
Turkish Text Diacritization
2 articles | 1 benchmarks
Understanding Fables
2 articles | 1 benchmarks
Unsupervised KG-to-Text Generation
2 articles | 4 benchmarks
Unsupervised Machine Translation
2 articles | 9 benchmarks
ValNov
2 articles | 2 benchmarks
Vietnamese Parsing
2 articles | 0 benchmarks
Vietnamese Text Diacritization
2 articles | 1 benchmarks
Visual Commonsense Tests
2 articles | 1 benchmarks
Workflow Discovery
2 articles | 1 benchmarks
Alignement visualisation
1 articles | 0 benchmarks
Anaphora Resolution
1 articles | 0 benchmarks
ARQMath2
1 articles | 0 benchmarks
Aspect Sentiment Triplet Extraction
1 articles | 4 benchmarks
Bangla Text Detection
1 articles | 1 benchmarks
Blackout Poetry Generation
1 articles | 1 benchmarks
Catalog Extraction
1 articles | 1 benchmarks
Cause-Effect Relation Classification
1 articles | 0 benchmarks
Chinese
1 articles | 0 benchmarks
Clinical Assertion Status Detection
1 articles | 1 benchmarks
Coding Problem Tagging
1 articles | 0 benchmarks
Commonsense Reasoning for RL
1 articles | 1 benchmarks
Complaint Comment Classification
1 articles | 0 benchmarks
Context-specific Spam Detection
1 articles | 1 benchmarks
Contextualized Literature-based Discovery
1 articles | 0 benchmarks
Controllable Language Modelling
1 articles | 0 benchmarks
Conversational Sentiment Quadruple Extraction
1 articles | 2 benchmarks
Counterspeech Detection
1 articles | 1 benchmarks
Cross-Document Language Modeling
1 articles | 2 benchmarks
Cross-Language Text Summarization
1 articles | 0 benchmarks
Cross-Lingual
1 articles | 0 benchmarks
Crowdsourced Text Aggregation
1 articles | 2 benchmarks
Detection of potentially void clauses
1 articles | 1 benchmarks
Dialogue
1 articles | 1 benchmarks
Direct NMT
1 articles | 0 benchmarks
Emergent communications on relations
1 articles | 0 benchmarks
Emotion Detection and Trigger Summarization
1 articles | 0 benchmarks
Entity Typing on DH-KGs
1 articles | 0 benchmarks
Extractive Tags Summarization
1 articles | 0 benchmarks
Fact-based Text Editing
1 articles | 2 benchmarks
FG-1-PG-1
1 articles | 3 benchmarks
Figurative Language Visualization
1 articles | 0 benchmarks
Genetic IE
1 articles | 0 benchmarks
GermEval2024 Shared Task 1 Subtask 1
1 articles | 1 benchmarks
GermEval2024 Shared Task 1 Subtask 2
1 articles | 1 benchmarks
Grapheme Detection
1 articles | 0 benchmarks
Grounded Open Vocabulary Acquisition
1 articles | 0 benchmarks
Hate Intensity Prediction
1 articles | 0 benchmarks
Hate Speech Detection CrisisHateMM Benchmark
1 articles | 0 benchmarks
Hurtful Sentence Completion
1 articles | 1 benchmarks
Joint Entity and Relation Extraction on Scientific Data
1 articles | 0 benchmarks
Joint NER and Classification
1 articles | 0 benchmarks
Latent Aspect Detection
1 articles | 0 benchmarks
Legal Document Translation
1 articles | 0 benchmarks
Line Items Extraction
1 articles | 0 benchmarks
Link prediction on DH-KGs
1 articles | 1 benchmarks
Medical question pair similarity computation
1 articles | 0 benchmarks
Meeting Summarization
1 articles | 2 benchmarks
Metric-Type Identification
1 articles | 0 benchmarks
MMSQL performance
1 articles | 1 benchmarks
Morphological Analysis
1 articles | 0 benchmarks
Multi-Dialect Vietnamese
1 articles | 0 benchmarks
Multi-Grained Named Entity Recognition
1 articles | 0 benchmarks
Multi-Labeled Relation Extraction
1 articles | 0 benchmarks
multi-word expression embedding
1 articles | 0 benchmarks
multi-word expression sememe prediction
1 articles | 0 benchmarks
Multilingual Machine Comprehension in English Hindi
1 articles | 1 benchmarks
Multimedia Generative Script Learning
1 articles | 0 benchmarks
Multimodal GIF Dialog
1 articles | 1 benchmarks
Multimodal Text Prediction
1 articles | 1 benchmarks
Multiview Contextual Commonsense Inference
1 articles | 2 benchmarks
Multlingual Neural Machine Translation
1 articles | 0 benchmarks
Natural Language Landmark Navigation Instructions Generation
1 articles | 1 benchmarks
Open-World Social Event Classification
1 articles | 0 benchmarks
Overlapping Mention Recognition
1 articles | 0 benchmarks
Pcl Detection
1 articles | 0 benchmarks
Persona Dialogue in Story
1 articles | 1 benchmarks
Phrase Vector Embedding
1 articles | 0 benchmarks
Poem meters classification
1 articles | 1 benchmarks
Problem-Solving Deliberation
1 articles | 1 benchmarks
Pronunciation Dictionary Creation
1 articles | 0 benchmarks
Propaganda technique identification
1 articles | 0 benchmarks
quantum circuit classification (classical ML)
1 articles | 0 benchmarks
Query Wellformedness
1 articles | 1 benchmarks
Question-Answer categorization
1 articles | 1 benchmarks
Question Quality Assessment
1 articles | 2 benchmarks
Question to Declarative Sentence
1 articles | 0 benchmarks
Readability optimization
1 articles | 0 benchmarks
relation explanation
1 articles | 0 benchmarks
Relation Mention Extraction
1 articles | 0 benchmarks
Row Annotation
1 articles | 1 benchmarks
Rules-of-thumb Generation
1 articles | 0 benchmarks
Semi-Supervised Formality Style Transfer
1 articles | 0 benchmarks
Speaker Attribution in German Parliamentary Debates (GermEval 2023, subtask 1)
1 articles | 1 benchmarks
Stance Detection (US Election 2020 - Biden)
1 articles | 1 benchmarks
Stance Detection (US Election 2020 - Trump)
1 articles | 1 benchmarks
Summarization Consistency Evaluation
1 articles | 1 benchmarks
Table Type Detection
1 articles | 1 benchmarks
Only Connect Walls Dataset Task 2 (Connections)
1 articles | 0 benchmarks
Taxonomy Learning
1 articles | 0 benchmarks
Text-to-CQL
1 articles | 0 benchmarks
Traditional Spam Detection
1 articles | 1 benchmarks
Tweet-Reply Sentiment Analysis
1 articles | 1 benchmarks
Variable Disambiguation
1 articles | 1 benchmarks
Vietnamese Lexical Normalization
1 articles | 0 benchmarks
Vietnamese Multimodal Sentiment Analysis
1 articles | 0 benchmarks
Visual Storytelling
1 articles | 1 benchmarks
Weakly Supervised Data Denoising
1 articles | 0 benchmarks
Web Page Tagging
1 articles | 0 benchmarks
Word Attribute Transfer
1 articles | 0 benchmarks
Zero-Shot Out-of-Domain Detection
1 articles | 0 benchmarks