HyperAIHyperAI

Command Palette

Search for a command to run...

GeMS Chemical Mass Spectrometry Dataset

Date

5 months ago

Size

470.6 GB

GeMS is a chemical mass spectrometry dataset that focuses on the storage and analysis of mass spectrometry (MS/MS) data, aiming to provide large-scale raw data support for molecular characterization, compound identification and metabolomics research. The dataset integrates a large number of unlabeled spectra from the Global Natural Products Social Mass Spectrum Library (GNPS) and is the core data foundation of the DreaMS project. The related paper results are:Self-supervised learning of molecular representations from millions of tandem mass spectra using DreaMS".

The dataset contains hundreds of millions of mass spectra (such as 2 billion in the GeMS-C1 subset), including structured numerical data (mass-to-charge ratio-intensity pairs of mass spectra) and metadata (such as spectral sources, experimental conditions, etc.). It is one of the largest public mass spectrometry datasets currently available and can support ultra-large-scale model training.

GeMS.torrent
Seeding 1Downloading 0Completed 88Total Downloads 200
  • GeMS/
    • README.md
      1.56 KB
    • README.txt
      3.12 KB
      • data/
        • GeMS.zip
          470.6 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp