Exclusive Chinese Subtitles! LeCun Student Alfredo's Spring AI Course Begins; CVPR'24 Remote Sensing Dataset Download

Recently, Alfredo Canziani, assistant professor of computer science at New York University and a student of Yann LeCun, released his spring "AI course", which covers topics such as discrete probability and naive Bayes, perceptrons and logistic regression, optimization, statistics and neural natural language processing, neural network classification, recurrent neural networks and convolutional neural networks.
This week, HyperAI will broadcast the course live on B Station 24/7. Let's learn together~

Watch link:

http://live.bilibili.com/26483094

From June 24 to June 28, hyper.ai official website updates:

High-quality public datasets: 10
High-quality tutorial selection: 3
Community Article Selection: 4 articles
Popular encyclopedia entries: 5
Top conferences with deadlines in July: 4

Visit the official website:hyper.ai

Selected public datasets

1. GeoChat Instruct Remote Sensing Multimodal Instruction Tracking Dataset

The dataset contains nearly 318,000 instructions and aims to extend multimodal instruction adaptation to the remote sensing domain to train multi-task conversational assistants. The related paper results have been accepted by CVPR 2024.

Direct use:https://go.hyper.ai/CXu0K

2. RRSIS-D Large Remote Sensing Image Segmentation Dataset

The dataset contains 17,402 image-description-mask triplets covering a variety of spatial resolutions and object orientations. The related paper results have been accepted by CVPR 2024.

Direct use:https://go.hyper.ai/1VRQG

3. Earth Parser Dataset Remote Sensing Mapping Dataset

This dataset is for training and evaluating parsing methods on large, uncurated aerial LiDAR scans. The dataset contains 7 scenes covering an area of more than 7.7 square kilometers and a total of 98 million 3D points. The related paper results have been accepted by CVPR 2024.

Direct use:https://go.hyper.ai/3pFjm

4. Harvard-GF3300 Retinal Neurological Disease (Glaucoma) Dataset

This dataset is a retinal neurological disease (glaucoma) dataset including 3,300 subjects, containing 2D and 3D image data. The dataset contains an equal number of subjects from 3 major racial groups (white, black, and Asian), which avoids data imbalance issues that may confuse fair learning issues.

Direct use:https://go.hyper.ai/vIhu6

5. Dental X-ray Images for Analysis Dental X-ray Image Dataset

This dataset contains a variety of dental orthodontic tomography (OPG) X-ray images, 70 high-quality samples. By providing annotations, this dataset can be used to train and test machine learning models for dental image analysis tasks such as tooth type classification and anomaly detection.

Direct use:https://go.hyper.ai/vK9zz

6. Multi-region fracture X-ray dataset

The dataset contains fractured and non-fractured X-ray images covering all anatomical body regions including lower limbs, upper limbs, lumbar spine, hip, knee, etc. The dataset is divided into training, testing, and validation folders, with a total of 10,580 radiological images (X-ray) data.

Direct use:https://go.hyper.ai/Yk1bA

7. Fruit and Vegetable Image Recognition Dataset

The dataset contains images of 10 kinds of fruits and 26 kinds of vegetables, and each category is divided into training, testing, and validation sets, providing a diverse set for image recognition tasks.

Direct use:https://go.hyper.ai/FdfRK

8. Character Codex

The dataset contains information about 15,939 popular characters from various media types and genres. Each entry contains detailed information about the character, the media source, and unique scenes involving the character.

Direct use:https://go.hyper.ai/wf1q1

9. RepLiQA is a possible question answering dataset for benchmarking

RepLiQA is an evaluation dataset containing "context-question-answer" triplets covering 17 topics or document categories, designed to test the ability of large language models (LLMs) to find and use contextual information in provided documents.

Direct use:https://go.hyper.ai/ZkSYD

10. CS-Eval Large Model Network Security Evaluation Dataset

The dataset covers 11 major areas of network security, 42 sub-areas, and 4,369 multiple-choice questions, true-or-false questions, and knowledge extraction questions. It provides comprehensive evaluation tasks based on knowledge and practice, supports user self-evaluation, and provides reference and inspiration for the implementation of large-scale models in network security.

Direct use:https://go.hyper.ai/ziacf

For more public datasets, please visit:

https://hyper.ai/datasets

Selected Public Tutorials

1. Bioclip biological classification hierarchical prediction demo

This tutorial demo can classify a given biological image by family, genus, species, etc. It is the Gradio version of the model in the best student paper "BioCLIP: A Vision Foundation Model for the Tree of Life" of CVPR2024.

Run online:https://go.hyper.ai/OEWk1
2. InstantStyle - a consistent image generator

InstantStyle is a text-to-image generation framework developed by the InstantX team of Xiaohongshu, which achieves style transfer while maintaining the text controllability of the content. This tutorial has built the relevant environment for you, and you can experience it with one click!

Run online:https://go.hyper.ai/E6GuW

3. Generate high-quality articles in 5 seconds, Llama 3-Chinese-Chat Demo starts with one click!

This model is a Chinese chat model fine-tuned specifically for Chinese based on the Meta-Llama-3-8b-Instruct model. Compared with the original Meta-Llama-3-8b-Instruct model, it significantly reduces the problems of "Chinese questions with English replies" and mixed Chinese and English. Simply clone and start the container, and directly copy the generated API address to experience the model inference.

Run online:https://go.hyper.ai/BLHcM

Community Articles

1. The strongest iron-based superconducting magnet is born! Scientists design a new research system based on machine learning, and the magnetic field strength exceeds the previous record by 2.7 times

Scientists from the UK and Japan used machine learning technology to design a research system that combines researcher-driven and data-driven methods, and successfully created the world's strongest known iron-based superconducting magnet. This article is a detailed interpretation and sharing of the research.

View event details:https://go.hyper.ai/RxV9x

2. Li Jianping, Vice President of Peking University Hospital: Using AI to solve the difficulties, bottlenecks and pain points in clinical myocardial ischemia prediction

At the Beijing Zhiyuan Conference, Professor Li Jianping, Vice President of Peking University First Hospital and Director of the Institute of Cardiovascular Medicine, shared AI's new exploration and practice in the diagnosis of coronary heart disease and the prediction of clinical myocardial ischemia under the title of "Methods and Difficulties of Clinical Myocardial Ischemia Prediction". This provides a new idea for the diagnosis and treatment of patients with coronary heart disease, and expands the focus from the heart to the kidneys, which is expected to make AI more valuable in clinical medicine. This article is a detailed summary of the speech.

Read the full interview:https://go.hyper.ai/5X9jM

A cell model with 310 million parameters is here! In a Nature journal, a Tsinghua University team released scFoundation: Simultaneous modeling of 20,000 genes

The Tsinghua University research team proposed a large cell model called sc-Foundation, which is trained based on the gene expression data of 50 million cells, has 100 million parameters, and can process about 20,000 genes at the same time. As a basic model, it has shown excellent performance improvement in a variety of biomedical downstream tasks such as cell sequencing depth enhancement, cell drug response prediction, and cell perturbation prediction. This article is a detailed interpretation and sharing of the research.

View the full report:https://go.hyper.ai/v5i5K

4. How do AI practitioners do science? Zhou Hao from Tsinghua University AIR: Cross-border exploration from text generation to protein design

Recently, Professor Zhou Hao from the Institute of Intelligent Industry of Tsinghua University, as a computer practitioner, shared with everyone the multiple challenges that AI people encounter in protein design, and described the latest cutting-edge research in the field of protein from three aspects: data structure, generation algorithm, and protein pre-training. This article reports on Professor Zhou Hao's in-depth sharing.

View the full report:https://go.hyper.ai/PTyAp

Popular Encyclopedia Articles

1. Scaling Theorem Scaling Law

2. Reciprocal ranking fusion RRF

3. Neural Radiance Field (NeRF)

4. Large-scale Multi-task Language Understanding (MMLU)

5. Kolmogorov-Arnold Representation Theorem

Here are hundreds of AI-related terms compiled to help you understand "artificial intelligence" here:

https://hyper.ai/wiki

Station B live broadcast preview

AIfredo Canziani is an assistant professor of computer science at New York University and a student of Yann LeCun. Recently, he released his spring "AI course" video. The knowledge points taught in each chapter include: discrete probability and naive Bayes; perceptron and logistic regression; optimization, statistics and neural natural language processing; neural network classification, etc. This week, Super Neural TV will broadcast the course 24/7.

The following table is a preview of the content selected by the editor↓↓↓

date	time	content
Monday, July 1	18:00	Part 1: Introduction to Naive Bayes
Tuesday, July 2	18:00	Part 2 Naive Bayes Classification
Wednesday, July 3	18:00	Part 3 Naive Bayes Parameter Estimation and Laplace Smoothing
Thursday, July 4	18:00	Part 4. Evaluation of Binary Classifiers
Friday, July 5	18:00	Part 5 Multiclass Perceptrons Binary and Multiclass Logistic Regression
Saturday, July 6	18:00	Part 6 Optimization and Gradient Ascent
Sunday, July 7	18:00	Alfredo Canziani's talk on Energy-Based Self-Supervised Learning

Super Neuro TV broadcasts live 24/7. Click to get the "electronic pickles" in the AI field:

http://live.bilibili.com/26483094

One-stop tracking of top AI academic conferences:https://hyper.ai/events

The above is all the content of this week’s editor’s selection. If you have resources that you want to include on the hyper.ai official website, you are also welcome to leave a message or submit an article to tell us!

See you next week!

About HyperAI

HyperAI (hyper.ai) is the leading artificial intelligence and high-performance computing community in China. We are committed to becoming the infrastructure in the field of data science in China and providing rich and high-quality public resources for domestic developers. So far, we have:

Provide domestic accelerated download nodes for 1300+ public data sets
Contains 400+ classic and popular online tutorials
Interpreting 100+ AI4Science paper cases
Support 500+ related terms search
Hosting the first complete Apache TVM Chinese documentation in China

Visit the official website to start your learning journey:

https://hyper.ai

HyperAI

Exclusive Chinese Subtitles! LeCun Student Alfredo's Spring AI Course Begins; CVPR'24 Remote Sensing Dataset Download

2 years ago

Information

Artificial Intelligence

Dataset

Machine Learning

Deep Learning

Watch link:

http://live.bilibili.com/26483094

From June 24 to June 28, hyper.ai official website updates:

High-quality public datasets: 10
High-quality tutorial selection: 3
Community Article Selection: 4 articles
Popular encyclopedia entries: 5
Top conferences with deadlines in July: 4

Visit the official website:hyper.ai

Selected public datasets

1. GeoChat Instruct Remote Sensing Multimodal Instruction Tracking Dataset

Direct use:https://go.hyper.ai/CXu0K

2. RRSIS-D Large Remote Sensing Image Segmentation Dataset

The dataset contains 17,402 image-description-mask triplets covering a variety of spatial resolutions and object orientations. The related paper results have been accepted by CVPR 2024.

Direct use:https://go.hyper.ai/1VRQG

3. Earth Parser Dataset Remote Sensing Mapping Dataset

Direct use:https://go.hyper.ai/3pFjm

4. Harvard-GF3300 Retinal Neurological Disease (Glaucoma) Dataset

Direct use:https://go.hyper.ai/vIhu6

5. Dental X-ray Images for Analysis Dental X-ray Image Dataset

Direct use:https://go.hyper.ai/vK9zz

6. Multi-region fracture X-ray dataset

Direct use:https://go.hyper.ai/Yk1bA

7. Fruit and Vegetable Image Recognition Dataset

Direct use:https://go.hyper.ai/FdfRK

8. Character Codex

Direct use:https://go.hyper.ai/wf1q1

9. RepLiQA is a possible question answering dataset for benchmarking

Direct use:https://go.hyper.ai/ZkSYD

10. CS-Eval Large Model Network Security Evaluation Dataset

Direct use:https://go.hyper.ai/ziacf

For more public datasets, please visit:

https://hyper.ai/datasets

Selected Public Tutorials

1. Bioclip biological classification hierarchical prediction demo

Run online:https://go.hyper.ai/OEWk1
2. InstantStyle - a consistent image generator

Run online:https://go.hyper.ai/E6GuW

3. Generate high-quality articles in 5 seconds, Llama 3-Chinese-Chat Demo starts with one click!

Run online:https://go.hyper.ai/BLHcM

Community Articles

View event details:https://go.hyper.ai/RxV9x

2. Li Jianping, Vice President of Peking University Hospital: Using AI to solve the difficulties, bottlenecks and pain points in clinical myocardial ischemia prediction

Read the full interview:https://go.hyper.ai/5X9jM

A cell model with 310 million parameters is here! In a Nature journal, a Tsinghua University team released scFoundation: Simultaneous modeling of 20,000 genes

View the full report:https://go.hyper.ai/v5i5K

4. How do AI practitioners do science? Zhou Hao from Tsinghua University AIR: Cross-border exploration from text generation to protein design

View the full report:https://go.hyper.ai/PTyAp

Popular Encyclopedia Articles

1. Scaling Theorem Scaling Law

2. Reciprocal ranking fusion RRF

3. Neural Radiance Field (NeRF)

4. Large-scale Multi-task Language Understanding (MMLU)

5. Kolmogorov-Arnold Representation Theorem

Here are hundreds of AI-related terms compiled to help you understand "artificial intelligence" here:

https://hyper.ai/wiki

Station B live broadcast preview

The following table is a preview of the content selected by the editor↓↓↓

date	time	content
Monday, July 1	18:00	Part 1: Introduction to Naive Bayes
Tuesday, July 2	18:00	Part 2 Naive Bayes Classification
Wednesday, July 3	18:00	Part 3 Naive Bayes Parameter Estimation and Laplace Smoothing
Thursday, July 4	18:00	Part 4. Evaluation of Binary Classifiers
Friday, July 5	18:00	Part 5 Multiclass Perceptrons Binary and Multiclass Logistic Regression
Saturday, July 6	18:00	Part 6 Optimization and Gradient Ascent
Sunday, July 7	18:00	Alfredo Canziani's talk on Energy-Based Self-Supervised Learning

Super Neuro TV broadcasts live 24/7. Click to get the "electronic pickles" in the AI field:

http://live.bilibili.com/26483094

One-stop tracking of top AI academic conferences:https://hyper.ai/events

See you next week!

About HyperAI

Provide domestic accelerated download nodes for 1300+ public data sets
Contains 400+ classic and popular online tutorials
Interpreting 100+ AI4Science paper cases
Support 500+ related terms search
Hosting the first complete Apache TVM Chinese documentation in China

Visit the official website to start your learning journey:

https://hyper.ai

Exclusive Chinese Subtitles! LeCun Student Alfredo's Spring AI Course Begins; CVPR'24 Remote Sensing Dataset Download

2 years ago

Information

Artificial Intelligence

Dataset

Machine Learning

Deep Learning

Watch link:

http://live.bilibili.com/26483094

From June 24 to June 28, hyper.ai official website updates:

High-quality public datasets: 10
High-quality tutorial selection: 3
Community Article Selection: 4 articles
Popular encyclopedia entries: 5
Top conferences with deadlines in July: 4

Visit the official website:hyper.ai

Selected public datasets

1. GeoChat Instruct Remote Sensing Multimodal Instruction Tracking Dataset

Direct use:https://go.hyper.ai/CXu0K

2. RRSIS-D Large Remote Sensing Image Segmentation Dataset

The dataset contains 17,402 image-description-mask triplets covering a variety of spatial resolutions and object orientations. The related paper results have been accepted by CVPR 2024.

Direct use:https://go.hyper.ai/1VRQG

3. Earth Parser Dataset Remote Sensing Mapping Dataset

Direct use:https://go.hyper.ai/3pFjm

4. Harvard-GF3300 Retinal Neurological Disease (Glaucoma) Dataset

Direct use:https://go.hyper.ai/vIhu6

5. Dental X-ray Images for Analysis Dental X-ray Image Dataset

Direct use:https://go.hyper.ai/vK9zz

6. Multi-region fracture X-ray dataset

Direct use:https://go.hyper.ai/Yk1bA

7. Fruit and Vegetable Image Recognition Dataset

Direct use:https://go.hyper.ai/FdfRK

8. Character Codex

Direct use:https://go.hyper.ai/wf1q1

9. RepLiQA is a possible question answering dataset for benchmarking

Direct use:https://go.hyper.ai/ZkSYD

10. CS-Eval Large Model Network Security Evaluation Dataset

Direct use:https://go.hyper.ai/ziacf

For more public datasets, please visit:

https://hyper.ai/datasets

Selected Public Tutorials

1. Bioclip biological classification hierarchical prediction demo

Run online:https://go.hyper.ai/OEWk1
2. InstantStyle - a consistent image generator

Run online:https://go.hyper.ai/E6GuW

3. Generate high-quality articles in 5 seconds, Llama 3-Chinese-Chat Demo starts with one click!

Run online:https://go.hyper.ai/BLHcM

Community Articles

View event details:https://go.hyper.ai/RxV9x

2. Li Jianping, Vice President of Peking University Hospital: Using AI to solve the difficulties, bottlenecks and pain points in clinical myocardial ischemia prediction

Read the full interview:https://go.hyper.ai/5X9jM

A cell model with 310 million parameters is here! In a Nature journal, a Tsinghua University team released scFoundation: Simultaneous modeling of 20,000 genes

View the full report:https://go.hyper.ai/v5i5K

4. How do AI practitioners do science? Zhou Hao from Tsinghua University AIR: Cross-border exploration from text generation to protein design

View the full report:https://go.hyper.ai/PTyAp

Popular Encyclopedia Articles

1. Scaling Theorem Scaling Law

2. Reciprocal ranking fusion RRF

3. Neural Radiance Field (NeRF)

4. Large-scale Multi-task Language Understanding (MMLU)

5. Kolmogorov-Arnold Representation Theorem

Here are hundreds of AI-related terms compiled to help you understand "artificial intelligence" here:

https://hyper.ai/wiki

Station B live broadcast preview

The following table is a preview of the content selected by the editor↓↓↓

date	time	content
Monday, July 1	18:00	Part 1: Introduction to Naive Bayes
Tuesday, July 2	18:00	Part 2 Naive Bayes Classification
Wednesday, July 3	18:00	Part 3 Naive Bayes Parameter Estimation and Laplace Smoothing
Thursday, July 4	18:00	Part 4. Evaluation of Binary Classifiers
Friday, July 5	18:00	Part 5 Multiclass Perceptrons Binary and Multiclass Logistic Regression
Saturday, July 6	18:00	Part 6 Optimization and Gradient Ascent
Sunday, July 7	18:00	Alfredo Canziani's talk on Energy-Based Self-Supervised Learning

Super Neuro TV broadcasts live 24/7. Click to get the "electronic pickles" in the AI field:

http://live.bilibili.com/26483094

One-stop tracking of top AI academic conferences:https://hyper.ai/events

See you next week!

About HyperAI

Provide domestic accelerated download nodes for 1300+ public data sets
Contains 400+ classic and popular online tutorials
Interpreting 100+ AI4Science paper cases
Support 500+ related terms search
Hosting the first complete Apache TVM Chinese documentation in China

Visit the official website to start your learning journey:

https://hyper.ai

Command Palette

Exclusive Chinese Subtitles! LeCun Student Alfredo's Spring AI Course Begins; CVPR'24 Remote Sensing Dataset Download

Command Palette

Exclusive Chinese Subtitles! LeCun Student Alfredo's Spring AI Course Begins; CVPR'24 Remote Sensing Dataset Download

Related News

Meta Proposes AI Data Scientists, and Autodata Builds high-quality training/evaluation datasets.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Fast and Accurate! Cohere Releases open-source Transcription Model; Accurate Parsing of Complex Scenarios: Chandra-ocr-2 Visual Language Model Achieves Precise OCR.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

MiniCPM5-1B, Trained Using RL+OPD, Achieves state-of-the-art (SOTA) Performance on Multiple Complex Tasks; the CHI-Bench Dataset for Evaluating Medical Agents, Designed for Automation of Complex Healthcare Processes, Has Been released.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.

Zero-sampling TTS Breakthrough! A Few Seconds of Reference Audio, OmniVoice Helps You Easily Clone Hundreds of Languages; 17 Languages All in One Go: MDPbench Solves the Major Problem of Parsing low-resource Text systems.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.

Command Palette

Exclusive Chinese Subtitles! LeCun Student Alfredo's Spring AI Course Begins; CVPR'24 Remote Sensing Dataset Download

Related News

Meta Proposes AI Data Scientists, and Autodata Builds high-quality training/evaluation datasets.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Fast and Accurate! Cohere Releases open-source Transcription Model; Accurate Parsing of Complex Scenarios: Chandra-ocr-2 Visual Language Model Achieves Precise OCR.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

MiniCPM5-1B, Trained Using RL+OPD, Achieves state-of-the-art (SOTA) Performance on Multiple Complex Tasks; the CHI-Bench Dataset for Evaluating Medical Agents, Designed for Automation of Complex Healthcare Processes, Has Been released.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.

Zero-sampling TTS Breakthrough! A Few Seconds of Reference Audio, OmniVoice Helps You Easily Clone Hundreds of Languages; 17 Languages All in One Go: MDPbench Solves the Major Problem of Parsing low-resource Text systems.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.

Related News

Meta Proposes AI Data Scientists, and Autodata Builds high-quality training/evaluation datasets.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Fast and Accurate! Cohere Releases open-source Transcription Model; Accurate Parsing of Complex Scenarios: Chandra-ocr-2 Visual Language Model Achieves Precise OCR.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

MiniCPM5-1B, Trained Using RL+OPD, Achieves state-of-the-art (SOTA) Performance on Multiple Complex Tasks; the CHI-Bench Dataset for Evaluating Medical Agents, Designed for Automation of Complex Healthcare Processes, Has Been released.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.

Zero-sampling TTS Breakthrough! A Few Seconds of Reference Audio, OmniVoice Helps You Easily Clone Hundreds of Languages; 17 Languages All in One Go: MDPbench Solves the Major Problem of Parsing low-resource Text systems.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.

Related News

Meta Proposes AI Data Scientists, and Autodata Builds high-quality training/evaluation datasets.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Fast and Accurate! Cohere Releases open-source Transcription Model; Accurate Parsing of Complex Scenarios: Chandra-ocr-2 Visual Language Model Achieves Precise OCR.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

MiniCPM5-1B, Trained Using RL+OPD, Achieves state-of-the-art (SOTA) Performance on Multiple Complex Tasks; the CHI-Bench Dataset for Evaluating Medical Agents, Designed for Automation of Complex Healthcare Processes, Has Been released.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.

Zero-sampling TTS Breakthrough! A Few Seconds of Reference Audio, OmniVoice Helps You Easily Clone Hundreds of Languages; 17 Languages All in One Go: MDPbench Solves the Major Problem of Parsing low-resource Text systems.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.