Alphafold3 Dependent Database Has Been Packaged and Launched! ICLR Full Score Paper IC-Light: Accurately Identify Light Tone Features

2 years ago

AlphaFold2 has caused a sensation in the AI4S field since its release and won this year's Nobel Prize. As its upgraded version, AlphaFold3 can not only predict the structure of proteins, but also predict the structure of proteins interacting with various other biological molecules, including how ligands (small molecules) and nucleic acids (DNA and RNA) gather together and interact with each other.

Just last month, Google DeepMind open-sourced the AlphaFold3 model weights and its dependency database for academic research. HyperAl has now launched the AlphaFold3 dependency database. Everyone is welcome to experience the technological breakthroughs brought by AlphaFold3 while reading the paper!

Online use:https://go.hyper.ai/wVItz

From December 9 to December 13, hyper.ai official website updates:

* High-quality public datasets: 10

* Selection of high-quality tutorials: 3

* Community article selection: 5 articles

* Popular encyclopedia entries: 5

* Top conferences with deadlines in January: 9

Visit the official website:hyper.ai

Selected public datasets

1. Alphafold3 depends on the database

The database contains a large number of protein and RNA databases that AlphaFold 3 relies on, including 9 databases: BFD small, MGnify, PDB, PDB seqres, UniProt, UniRef90, NT, RFam and RNACentral.

Direct use:https://go.hyper.ai/wVItz

2. Mol-Instructions Large-scale Biomolecule Instruction Dataset

The dataset contains three types of instructions: molecule-oriented instructions, protein-oriented instructions, and biomolecule text instructions. It aims to provide rich instruction data to enhance the understanding and prediction capabilities of large language models in the field of biomolecules.

Direct use:https://go.hyper.ai/Gut1y

3. CoSQL Conversational Text to SQL Dataset

CoSQL contains 3k+ groups of conversations, a total of 10k+ annotated SQL queries, spanning 200 databases, and the databases used by different groups of data have no intersection, in order to examine the robustness of the model.

Direct use:https://go.hyper.ai/9Blzy

4. QAngaroo multi-step reasoning reading comprehension dataset

The dataset consists of two parts: WikiHop and MedHop, which aims to build a reading comprehension method that can perform multi-hop reasoning, that is, facts scattered in different documents require multiple steps of reasoning to derive new facts.

Direct use:https://go.hyper.ai/u1qRw

5. TCM Ancient Books Traditional Chinese Medicine Ancient Books Dataset

This dataset contains about 700 ancient Chinese medicine texts, covering medical classics from the pre-Qin period to the late Qing Dynasty and the Republic of China. These documents not only include medical theories, prescriptions, pharmacology, etc., but also contain rich clinical cases and medical encyclopedia knowledge.

Direct use:https://go.hyper.ai/8Vh6A

6. IndustryCorpus2 Healthcare Dataset Subset

This dataset is a high-quality data resource library specifically for research and application in the field of medical health. It has undergone a rigorous screening and cleaning process to ensure the accuracy and reliability of the data. It covers a wide range of data types in the field of medical health, including medical records, medical literature, and patient feedback, providing researchers and developers with a comprehensive perspective to explore and innovate.

Direct use:https://go.hyper.ai/G9qn2

7. P-MMEval Multi-language Multi-task Benchmark Dataset

The dataset contains 3 basic natural language processing (NLP) datasets and 5 advanced capability-specific datasets, covering tasks such as code generation, knowledge understanding, mathematical reasoning, logical reasoning, and instruction following.

Direct use:https://go.hyper.ai/qbzhv

8. ShenNong TCM Dataset ShenNong TCM Dataset

This dataset contains more than 110,000 instruction data, which are generated through an entity-centric self-instruction method. It focuses on the core entities and different intent scenarios in the field of traditional Chinese medicine. It can not only improve the model's ability to answer questions related to traditional Chinese medicine, but also assist in traditional Chinese medicine diagnosis and provide personalized medical advice.

Direct use:https://go.hyper.ai/Okruv

9. DS-1000 Code Generation Benchmark Dataset

The dataset contains 1k real-world data science questions from StackOverflow, covering 7 widely used data science libraries in Python, such as NumPy, Pandas, TensorFlow, etc.

Direct use:https://go.hyper.ai/AL4h0

10. IndustryCorpus2-tourism-geography Tourism geography dataset

This dataset is a subset of the tourism geography dataset of IndustryCorpus2, which covers a wide range of data types in the field of tourism geography, including attraction introductions, travel guides, tourist reviews, and geographic information, providing rich application scenarios for various research and application fields such as natural language processing, machine learning, data mining, and tourism recommendation systems.

Direct use:https://go.hyper.ai/FIAM9

Selected Public Tutorials

1. Allegro video generation demo

Allegro has the ability to convert basic text input into high-definition video content, with 720p resolution, 15 frames per second, and a maximum video length of 6 seconds. The model has demonstrated excellent performance in the field of video synthesis, with excellent performance in both quality and temporal coherence.

This tutorial is a model inference tutorial. Since it takes a long time for the model to generate a video, this tutorial can generate a 5-second video effect.

Run online:https://go.hyper.ai/MgUVZ

2. IC-Light v2: AI lighting control upgrade demo

IC-Light stands for Imposing Consistent Light, which aims to achieve image relighting projects through machine learning models. This tutorial is an upgraded version of IC-Light v2. Compared with the original IC-Light, this version is trained based on the Flux model, which enables it to more accurately identify the lighting and tone characteristics of the image and achieve a more detailed and realistic fusion effect.

Click the link below and follow the tutorial to control the lighting effects in your image.

Run online:https://go.hyper.ai/hg0cM

3. Hunyuan3D: Generate 3D assets in just 10 seconds

Hunyuan3D is a 3D generative diffusion model, including a lightweight version and a standard version, both of which support the generation of high-quality 3D assets from text and image inputs. After qualitative and quantitative multi-dimensional evaluation, Hunyuan3D-1.0 performs very well in terms of geometric details, texture details, texture-geometry consistency, 3D rationality, and instruction compliance.

This tutorial is a lightweight version of Hunyuan3D. Click the link below and follow the tutorial instructions to experience 3D model generation.

Run online:https://go.hyper.ai/Rsrno

Community Articles

1. AI godfather Hinton is in charge, and materials startup CuspAI has become one of the most noteworthy British startups

As a startup, CuspAI's strength should not be underestimated. Its seed round financing was as high as 30 million US dollars, becoming one of the largest seed round financings in Europe that year. In addition, Max Welling, a machine learning expert, is one of the company's co-founders, and Geoffrey Hinton, a double winner of the Nobel Prize and Turing Award, is a board advisor to the company. This article is a detailed introduction to CuspAI.

View the full report:https://go.hyper.ai/3fQFG

2. Cracking the "black box" problem of time series prediction! Huazhong University of Science and Technology proposed CGS-Mask to reveal the key indicators of patient survival rate

With the widespread application of AI technology in our daily lives, the "interpretability" of models has gradually become an urgent problem to be solved. This problem is particularly prominent in time series prediction tasks. In order to make time series prediction a "visible" process, Lu Feng's team at Huazhong University of Science and Technology, in collaboration with the Zomaya team of the University of Sydney and Tongji Hospital, proposed a new method, CGS-Mask. By combining time series prediction with interpretability, this method can not only improve the model prediction accuracy, but also make the prediction results more intuitive and interpretable. This article is a detailed interpretation and sharing of the paper.

View the full report:https://go.hyper.ai/TFEsd

3. Be the first to experience the demo! The genome-based model Evo is on the cover of Science, achieving prediction and generation from molecular to genomic scales

A research team from Stanford University and the Arc Institute in the United States proposed a genome-based model, Evo, which was published on the cover of Science. It can achieve zero-shot prediction and high-precision generation in multimodal tasks of DNA, RNA, and protein. The HyperAI Super Neural Tutorial section is now online, "Evo: Prediction and Generation from Molecular to Genome Scale", which can be quickly experienced by cloning with one click!

View the full report:https://go.hyper.ai/5WPGm

4. AI godfather Hinton was born into a family of geniuses, but he was a habitual dropout. He won the Turing Award and the Nobel Prize in his seventies.

AI godfather Hinton was born into a family of geniuses, but he is a repeat dropout; his startup had only three people, but was acquired by Google for $44 million; he spent nearly half a century developing neural networks, but he said he regretted it... What kind of life experience made him what he is today? This article is an in-depth report on Hinton.

View the full report:https://go.hyper.ai/EHWs6

5. The first vLLM Chinese document is online! The latest version increases throughput by 2.7 times and reduces latency by 5 times, making large language model inference faster!

vLLM is a framework designed for large language model reasoning acceleration, achieving almost zero waste of KV cache memory. The latest version v0.6.4 introduces multi-step scheduling and asynchronous output processing, further optimizing GPU utilization and improving processing efficiency. In order to help domestic developers more easily learn about vLLM version updates and cutting-edge developments, the HyperAI Super Neural Community has completed the localization of vLLM Chinese documentation.

View vLLM Chinese documentation:https://vllm.hyper.ai/

Alphafold3 Dependent Database Has Been Packaged and Launched! ICLR Full Score Paper IC-Light: Accurately Identify Light Tone Features

2 years ago

Information

vLLM

Artificial Intelligence

Deep Learning

Online use:https://go.hyper.ai/wVItz

From December 9 to December 13, hyper.ai official website updates:

* High-quality public datasets: 10

* Selection of high-quality tutorials: 3

* Community article selection: 5 articles

* Popular encyclopedia entries: 5

* Top conferences with deadlines in January: 9

Visit the official website:hyper.ai

Selected public datasets

1. Alphafold3 depends on the database

The database contains a large number of protein and RNA databases that AlphaFold 3 relies on, including 9 databases: BFD small, MGnify, PDB, PDB seqres, UniProt, UniRef90, NT, RFam and RNACentral.

Direct use:https://go.hyper.ai/wVItz

2. Mol-Instructions Large-scale Biomolecule Instruction Dataset

Direct use:https://go.hyper.ai/Gut1y

3. CoSQL Conversational Text to SQL Dataset

Direct use:https://go.hyper.ai/9Blzy

4. QAngaroo multi-step reasoning reading comprehension dataset

Direct use:https://go.hyper.ai/u1qRw

5. TCM Ancient Books Traditional Chinese Medicine Ancient Books Dataset

Direct use:https://go.hyper.ai/8Vh6A

6. IndustryCorpus2 Healthcare Dataset Subset

Direct use:https://go.hyper.ai/G9qn2

7. P-MMEval Multi-language Multi-task Benchmark Dataset

Direct use:https://go.hyper.ai/qbzhv

8. ShenNong TCM Dataset ShenNong TCM Dataset

Direct use:https://go.hyper.ai/Okruv

9. DS-1000 Code Generation Benchmark Dataset

The dataset contains 1k real-world data science questions from StackOverflow, covering 7 widely used data science libraries in Python, such as NumPy, Pandas, TensorFlow, etc.

Direct use:https://go.hyper.ai/AL4h0

10. IndustryCorpus2-tourism-geography Tourism geography dataset

Direct use:https://go.hyper.ai/FIAM9

Selected Public Tutorials

1. Allegro video generation demo

This tutorial is a model inference tutorial. Since it takes a long time for the model to generate a video, this tutorial can generate a 5-second video effect.

Run online:https://go.hyper.ai/MgUVZ

2. IC-Light v2: AI lighting control upgrade demo

Click the link below and follow the tutorial to control the lighting effects in your image.

Run online:https://go.hyper.ai/hg0cM

3. Hunyuan3D: Generate 3D assets in just 10 seconds

This tutorial is a lightweight version of Hunyuan3D. Click the link below and follow the tutorial instructions to experience 3D model generation.

Run online:https://go.hyper.ai/Rsrno

Community Articles

1. AI godfather Hinton is in charge, and materials startup CuspAI has become one of the most noteworthy British startups

View the full report:https://go.hyper.ai/3fQFG

2. Cracking the "black box" problem of time series prediction! Huazhong University of Science and Technology proposed CGS-Mask to reveal the key indicators of patient survival rate

View the full report:https://go.hyper.ai/TFEsd

3. Be the first to experience the demo! The genome-based model Evo is on the cover of Science, achieving prediction and generation from molecular to genomic scales

View the full report:https://go.hyper.ai/5WPGm

4. AI godfather Hinton was born into a family of geniuses, but he was a habitual dropout. He won the Turing Award and the Nobel Prize in his seventies.

View the full report:https://go.hyper.ai/EHWs6

5. The first vLLM Chinese document is online! The latest version increases throughput by 2.7 times and reduces latency by 5 times, making large language model inference faster!

View vLLM Chinese documentation:https://vllm.hyper.ai/

Command Palette

Alphafold3 Dependent Database Has Been Packaged and Launched! ICLR Full Score Paper IC-Light: Accurately Identify Light Tone Features

Selected public datasets

Selected Public Tutorials

Community Articles

Popular Encyclopedia Articles

Command Palette

Alphafold3 Dependent Database Has Been Packaged and Launched! ICLR Full Score Paper IC-Light: Accurately Identify Light Tone Features

Selected public datasets

Selected Public Tutorials

Community Articles

Popular Encyclopedia Articles

Related News

Achieve "voice-over Freedom" With Just 3 Seconds of Audio: Mistral open-source Speech Model Voxtral-4B-TTS-2603; Set a New Benchmark for Data Quality: Sutra 10B Pretraining.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Fast and Accurate! Cohere Releases open-source Transcription Model; Accurate Parsing of Complex Scenarios: Chandra-ocr-2 Visual Language Model Achieves Precise OCR.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

Anima V1, a brand-new Raw Image Model, Has Been Released, Focusing on anime-style Image Generation; the MemLens Multimodal long-range Memory Evaluation Dataset Covers cross-conversation text-to-image Reasoning and Knowledge Update mechanisms.

ICLR 2026 | 125x Reduction in Trainable Parameters Per Task! New Method Task Tokens Helps Embodied Intelligence Enhance Complex Task Capabilities

MiniCPM5-1B, Trained Using RL+OPD, Achieves state-of-the-art (SOTA) Performance on Multiple Complex Tasks; the CHI-Bench Dataset for Evaluating Medical Agents, Designed for Automation of Complex Healthcare Processes, Has Been released.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.

Command Palette

Alphafold3 Dependent Database Has Been Packaged and Launched! ICLR Full Score Paper IC-Light: Accurately Identify Light Tone Features

Selected public datasets

Selected Public Tutorials

Community Articles

Popular Encyclopedia Articles

Related News

Achieve "voice-over Freedom" With Just 3 Seconds of Audio: Mistral open-source Speech Model Voxtral-4B-TTS-2603; Set a New Benchmark for Data Quality: Sutra 10B Pretraining.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Fast and Accurate! Cohere Releases open-source Transcription Model; Accurate Parsing of Complex Scenarios: Chandra-ocr-2 Visual Language Model Achieves Precise OCR.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

Anima V1, a brand-new Raw Image Model, Has Been Released, Focusing on anime-style Image Generation; the MemLens Multimodal long-range Memory Evaluation Dataset Covers cross-conversation text-to-image Reasoning and Knowledge Update mechanisms.

ICLR 2026 | 125x Reduction in Trainable Parameters Per Task! New Method Task Tokens Helps Embodied Intelligence Enhance Complex Task Capabilities

MiniCPM5-1B, Trained Using RL+OPD, Achieves state-of-the-art (SOTA) Performance on Multiple Complex Tasks; the CHI-Bench Dataset for Evaluating Medical Agents, Designed for Automation of Complex Healthcare Processes, Has Been released.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.

Related News

Achieve "voice-over Freedom" With Just 3 Seconds of Audio: Mistral open-source Speech Model Voxtral-4B-TTS-2603; Set a New Benchmark for Data Quality: Sutra 10B Pretraining.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Fast and Accurate! Cohere Releases open-source Transcription Model; Accurate Parsing of Complex Scenarios: Chandra-ocr-2 Visual Language Model Achieves Precise OCR.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

Anima V1, a brand-new Raw Image Model, Has Been Released, Focusing on anime-style Image Generation; the MemLens Multimodal long-range Memory Evaluation Dataset Covers cross-conversation text-to-image Reasoning and Knowledge Update mechanisms.

ICLR 2026 | 125x Reduction in Trainable Parameters Per Task! New Method Task Tokens Helps Embodied Intelligence Enhance Complex Task Capabilities

MiniCPM5-1B, Trained Using RL+OPD, Achieves state-of-the-art (SOTA) Performance on Multiple Complex Tasks; the CHI-Bench Dataset for Evaluating Medical Agents, Designed for Automation of Complex Healthcare Processes, Has Been released.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.

Related News

Achieve "voice-over Freedom" With Just 3 Seconds of Audio: Mistral open-source Speech Model Voxtral-4B-TTS-2603; Set a New Benchmark for Data Quality: Sutra 10B Pretraining.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Fast and Accurate! Cohere Releases open-source Transcription Model; Accurate Parsing of Complex Scenarios: Chandra-ocr-2 Visual Language Model Achieves Precise OCR.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

Anima V1, a brand-new Raw Image Model, Has Been Released, Focusing on anime-style Image Generation; the MemLens Multimodal long-range Memory Evaluation Dataset Covers cross-conversation text-to-image Reasoning and Knowledge Update mechanisms.

ICLR 2026 | 125x Reduction in Trainable Parameters Per Task! New Method Task Tokens Helps Embodied Intelligence Enhance Complex Task Capabilities

MiniCPM5-1B, Trained Using RL+OPD, Achieves state-of-the-art (SOTA) Performance on Multiple Complex Tasks; the CHI-Bench Dataset for Evaluating Medical Agents, Designed for Automation of Complex Healthcare Processes, Has Been released.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.