HyperAI
HyperAI
Home
Console
Docs
News
Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
Search the site…
⌘
K
Command Palette
Search for a command to run...
Console
Home
SOTA
Visual Storytelling
Visual Storytelling On Vist
Visual Storytelling On Vist
Metrics
BLEU-4
CIDEr
METEOR
ROUGE-L
Results
Performance results of various models on this benchmark
Columns
Model Name
BLEU-4
CIDEr
METEOR
ROUGE-L
Paper Title
HEGR
16.7
14.1
37.8
-
Two Heads are Better Than One: Hypergraph-Enhanced Graph Reasoning for Visual Event Ratiocination
TAPM
-
13.8
37.2
33.1
Transitional Adaptation of Pretrained Models for Visual Storytelling
HBSG
15.4
-
36.5
-
Visual Storytelling with Hierarchical BERT Semantic Guidance
CoVS
15.2
11.5
36.5
30.8
Coherent Visual Storytelling via Parallel Top-Down Visual and Topic Attention
MCSM+RNN
13
11
36.1
30.7
Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling
AOG + ARS
12.9
12.0
36.0
30.1
AOG-LSTM: An adaptive attention neural network for visual storytelling
SGVST
14.7
9.8
35.8
29.9
Storytelling from an Image Stream Using Scene Graphs
TAVST (RL)
14.6
9.2
35.7
31
Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication
SentiStory
14.8
10.1
35.7
30.2
SentiStory: A Multi-Layered Sentiment-Aware Generative Model for Visual Storytelling
INet
14.7
10
35.6
29.7
Hide-and-Tell: Learning to Bridge Photo Streams for Visual Storytelling
SGEmb
14.8
8.6
35.6
30.2
Diverse and Relevant Visual Storytelling with Scene Graph Embeddings
IRW
15.4
11.0
35.6
29.6
Imagine, Reason and Write: Visual Storytelling with Graph Knowledge and Relational Reasoning
MemNet
14.1
-
35.5
-
Hierarchical memory decoder for visual narrating
StoryAnchor: w/ Predicted Nouns
14
9.9
35.5
30
Visual Storytelling via Predicting Anchor Word Embeddings in the Stories
VSCMR
14.3
9
35.5
30.2
Informative Visual Storytelling with Cross-modal Rules
ViT-model
12.3
4.4
35.4
31
Vision Transformer Based Model for Describing a Set of Images as a Story
HSRL w/ Joint Training
12.32
10.71
35.23
30.84
Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation
K-Storyteller
12.8
12.1
35.2
29.9
Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling
BLEU-RL
14.4
6.7
35.2
30.1
What Makes A Good Story? Designing Composite Rewards for Visual Storytelling
AREL
13.6
9.1
35.2
29.3
What Makes A Good Story? Designing Composite Rewards for Visual Storytelling
0 of 33 row(s) selected.
Previous
Next