HyperAIHyperAI

Command Palette

Search for a command to run...

Sentiment Classification Using Document Embeddings Trained with Cosine Similarity

Tan Thongtan Tanasanee Phienthrakul

Abstract

In document-level sentiment classification, each document must be mapped to a fixed length vector. Document embedding models map each document to a dense, low-dimensional vector in continuous vector space. This paper proposes training document embeddings using cosine similarity instead of dot product. Experiments on the IMDB dataset show that accuracy is improved when using cosine similarity compared to using dot product, while using feature combination with Naive Bayes weighted bag of n-grams achieves a new state of the art accuracy of 97.42{%}. Code to reproduce all experiments is available at https://github.com/tanthongtan/dv-cosine


Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing

HyperAI Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Sentiment Classification Using Document Embeddings Trained with Cosine Similarity | Papers | HyperAI