HyperAI

MIND Microsoft News Dataset

Date

9 months ago

Size

1.15 GB

Organization

Microsoft
Tsinghua University

Publish URL

msnews.github.io

Tags

Categories

MIcrosoft News Dataset (MIND) is a large dataset for news recommendation research, which is collected from anonymous behavior logs of the Microsoft News website. The mission of MIND is to serve as a benchmark dataset for news recommendation and to promote research in the field of news recommendation and recommendation systems.

MIND contains about 160,000 English news articles and more than 15 million impression logs generated by 1 million users. Each news article contains rich text content, including title, summary, body, categories, and entities. Each impression log contains the user's click events, non-click events, and historical news click behaviors before this impression. To protect user privacy, each user is decoupled from the production system when securely hashed into an anonymous ID.

This dataset was released by Microsoft and Tsinghua University in 2020. The relevant paper is “MIND: A Large-scale Dataset for News Recommendation", selected for ACL 2020.

MIcrosoftNewsDataset-MIND.torrent
Seeding 1Downloading 1Completed 92Total Downloads 150
  • MIcrosoftNewsDataset-MIND/
    • README.md
      1.54 KB
    • README.txt
      3.08 KB
      • data/
        • mind.zip
          1.15 GB