Machine learning boosts type 1 diabetes genetic risk prediction
Researchers at the University of California San Diego have developed a new machine learning model that significantly improves the accuracy of predicting genetic risk for Type 1 diabetes across diverse populations. Published on April 30, 2026, in Nature Genetics, the study introduces a tool called T1GRS, which moves beyond previous limitations by analyzing complex interactions between hundreds of genetic variants to identify high-risk individuals earlier. Type 1 diabetes occurs when the immune system destroys insulin-producing cells, requiring lifelong insulin therapy. Historically, predicting who would develop the condition relied on identifying known high-risk genetic markers, a method that often missed individuals with more subtle or complex genetic profiles. The UC San Diego team addressed this by analyzing genome datasets from over 20,000 people of European ancestry with Type 1 diabetes and nearly 800,000 without the disorder. This extensive analysis confirmed risk variants at 79 previously known locations and identified 13 new loci involved in immune function and blood sugar regulation. The team also pinpointed novel variants within the major histocompatibility complex, a critical region on chromosome 6. The resulting T1GRS model incorporates non-linear interactions among 199 of these risk variants. Unlike earlier tools that performed best only for those with the highest known risk, T1GRS maintains high accuracy across a broader range of individuals. Co-first author TJ Sears noted that the model successfully identified patients who developed diabetes despite lacking the classic high-risk genetic regions, a feat previous diagnostics could not achieve. Furthermore, the analysis enabled researchers to categorize Type 1 diabetes patients into four distinct genetic sub-types, each associated with unique clinical outcomes. This classification proved robust when tested on independent datasets from the National Institutes of Health's All of Us Research Program and the National Pancreatic Organ Donor biobank. Despite being trained primarily on European ancestry data, the model maintained 87% accuracy in these external groups and successfully identified the same four sub-types in non-European populations, including African Americans. Co-first author Emily Griffin emphasized that while certain high-risk genetic blocks do not guarantee disease onset, their absence strongly indicates a very low probability of developing Type 1 diabetes. The ability to predict risk in individuals without traditional markers opens new avenues for clinical intervention. Co-first author Carolyn McGrail highlighted that broadening the pool of identified high-risk individuals allows for closer monitoring and earlier treatment. This strategy aims to reduce severe complications at diagnosis, such as diabetic ketoacidosis, and to identify eligible candidates for preventative therapies like teplizumab. The study suggests that T1GRS could soon serve as a widespread clinical screening tool, facilitating personalized treatment strategies for both children and adults. By capturing those previously missed by standard genetic risk scoring, the model represents a significant step forward in the management and prevention of Type 1 diabetes.
