HyperAI

ProteinGym Protein Mutation Dataset

Date

a year ago

Publish URL

github.com

Categories

Download Help

The dataset contains a total of approximately 1.5 million missense variants from 87 DMS sequencing experiments.

paper"Enhancing efficiency of protein language models with minimal wet-lab data through few-shot learning"Using this dataset as a benchmark dataset, the results have been published in Nature Communications, a subsidiary of Nature