Word-level CNN+LSTM (partial scoring) | 53.3 | A Simple Method for Commonsense Reasoning | |
USSM + Supervised Deepnet | 53.3 | Attention Is (not) All You Need for Commonsense Reasoning | |
USSM + Cause-Effect Knowledge Base | 55.0 | Probabilistic Reasoning via Deep Learning: Neural Association Models | - |
Subword-level Transformer LM | 58.3 | Attention Is All You Need | |
Word-level CNN+LSTM (full scoring) | 60.0 | A Simple Method for Commonsense Reasoning | |
USSM + Supervised Deepnet + 3 Knowledge Bases | 66.7 | Attention Is (not) All You Need for Commonsense Reasoning | |