HyperAIHyperAI

SMolInstruct Chemical Instruction fine-tuning Dataset

Date

2 years ago

Size

660.72 MB

Publish URL

github.com

Paper URL

arxiv.org

License

CC BY 4.0

特色图像

SMolInstruct is a large-scale, comprehensive and high-quality chemical instruction fine-tuning dataset proposed by Ohio State University. The dataset contains 14 different chemical tasks, a total of more than 3 million samples, and covers 1.6 million unique molecules. Researchers collected data related to chemical tasks from multiple sources, covering chemical knowledge representations such as IUPAC names, SMILES representations, molecular formulas, as well as tasks such as molecular property prediction, chemical reaction prediction, and molecular description.

SMolInstruct.torrent
Seeding 1Downloading 0Completed 187Total Downloads 204
  • SMolInstruct/
    • README.md
      1.18 KB
    • README.txt
      2.35 KB
      • data/
        • SMolInstruct.zip
          660.72 MB