Event Review | Shanghai Jiaotong University/Institute of Computing Technology, Chinese Academy of Sciences/Microsoft Asia Research/Zhiyuan Shared Practical Information, and the 5th Meet AI Compiler Technology Salon Ended Successfully

For this event, we were fortunate to invite several AI compiler experts from Shanghai Jiao Tong University, Institute of Computing Technology of the Chinese Academy of Sciences, Microsoft Research Asia, and Beijing Academy of Artificial Intelligence. They not only brought the latest research results, but also combined rich application practices to provide an in-depth and easy-to-understand technical sharing for hundreds of participants present.

Although the weather was very hot on the day of the event, the enthusiasm of the participants was not diminished at all. The atmosphere of the on-site discussion was very lively, which inspired everyone to think more deeply about AI compiler technology. As an open and inclusive community, HyperAI is very happy to bring everyone together and provide a high-quality academic exchange platform to promote the development of AI compiler technology.

We welcome more friends to join our AI Compiler Family and explore the infinite possibilities of AI compiler technology with us!

Event content review

The following is a brief introduction to the event and a video review of the event.

Share topic:MLCEngine: A Universal LLM Deployment Engine

Contents:This sharing will introduce MLCEngine, an LLM engine that can be universally deployed on different platforms. MLCEngine not only has high-throughput, low-latency LLM serving capabilities on the server, but also supports seamless deployment of today's high-quality large language models in various local environments.

Share Video:

【2024 Meet AI Compiler】Feng Siyuan-MLCEngine: A Universal LLM Deployment Engine_哔哩哔哩_bilibiliwww.bilibili.com/video/BV1Ji421Y7je/?vd_source=5e54209e1f8c68b7f1dc3df8aabf856c

Share topic:ElasticRoom: Multi-Tenant DNN Inference Engine via Co-design with Resource-constrained Compilation and Strong Priority Scheduling

Contents:GPU resource partitioning mechanisms in runtime software have been widely used in job schedulers and multi-tenant computing systems to improve resource utilization and throughput. However, existing GPU resource partitioning mechanisms cannot simultaneously improve GPU resource utilization and ensure low latency for real-time requests when facing batch heterogeneous DNN inference requests.We propose an innovative multi-tenant DNN inference engine, ElasticRoom, which builds resource-constrained compilation based on TVM and achieves both high GPU utilization and low latency for real-time requests through priority scheduling.

Share Video:

【2024 Meet AI Compiler】Ma Lixian-ElasticRoom: Multi-Tenant DNN Inference Engine_哔哩哔哩_bilibiliwww.bilibili.com/video/BV1uE421P7zm/?vd_source=5e54209e1f8c68b7f1dc3df8aabf856c

Share topic:FlagGems, a Triton-based large model operator library, innovative practice

Contents:Based on OpenAI's Triton language, we developed a high-performance general operator library FlagGems to provide inference and training acceleration for large models under the PyTorch framework. In view of the programming characteristics of Triton, we applied two technical innovations: runtime optimization and automatic code generation, which expanded the expression ability of operators and improved the performance of operators.

Share Video:

https://www.bilibili.com/video/BV1ES421R7o7/?vd_source=5e54209e1f8c68b7f1dc3df8aabf 856cwww.bilibili.com/video/BV1ES421R7o7/?vd_source=5e54209e1f8c68b7f1dc3df8aabf856c

2024 AI Compiler · Coming Soon

The 6th 2024 Meet AI Compiler Technical Salon is expected to be held in Shanghai at the end of the year. We sincerely invite all companies and community partners to participate in co-creation in various forms. Whether it is recommending lecturers or sponsoring venues and tea breaks, we welcome them all.

Let's work together to create the most active AI compiler community in China! Finally, let's share a group photo of the scene❤️

Organizers and partners

HyperAI is a leading artificial intelligence and high-performance computing community in China.It aims to help developers and enthusiasts in China's data science and artificial intelligence industry learn, understand and practice by providing various infrastructures such as accelerated data set downloads, online tutorial demonstrations, in-depth interpretation of papers, and top conference calendar integration, and build the future of artificial intelligence with the community. Currently, the official website of Super Neuro has launched thousands of classic and high-quality public data sets and tutorials, and operates the most active AI compiler community in China.

Visit the official website:https://hyper.ai/

OpenBayes Bayesian Computing is a leading high-performance computing service provider in ChinaBy grafting classic software ecosystems and machine learning models onto new-generation heterogeneous chips, it provides industrial enterprises and university scientific research with faster and easier-to-use data science computing products. Its products have been adopted by dozens of large industrial scenarios or leading scientific research institutes.

Visit the official website:https://openbayes.com/

The MLC.AI community was established in June 2022. Chen Tianqi, the main inventor of Apache TVM and a well-known young scholar in the field of machine learning, led the team to launch the MLC online course, which systematically introduced the key elements and core concepts of machine learning compilation.

In November 2022, with the joint efforts of MLC.AI community volunteers, the first complete TVM Chinese documentation was launched and successfully hosted on the HyperAI official website, further providing domestic developers interested in machine learning compilation with the basic settings for accessing and learning a new technology - documentation.

MLC Online Courses:https://mlc.ai/

TVM Chinese Documentation:https://tvm.hyper.ai/

The Institute of Computing Technology of the Chinese Academy of Sciences (ICT) was founded in 1956 and is the first academic institution in China dedicated to comprehensive research in computer science and technology.The Institute of Computing Technology successfully developed my country's first general-purpose digital electronic computer and formed a research and development base for my country's high-performance computers. my country's first general-purpose CPU chip was also born here.

The Institute of Computing Technology is the cradle of my country's computer industry. With the development of the Institute of Computing Technology, it has trained hundreds of my country's earliest computing technology professionals for the country, and more than 20 academicians have worked or studied here. With the development of disciplines and technologies, several research institutes such as the Xi'an Microelectronics Institute, the Computing Center, the Software Institute, the Network Center, the Microelectronics Institute, and the Information Engineering Institute have been separated from the Institute of Computing Technology, and high-tech companies such as Lenovo, Sugon, Loongson, and Cambrian have been incubated.

The Technical Committee of HPC (China Computer Federation, abbreviated as CCF TCHPC) was established in 2005 with the approval of the China Computer Federation. As a professional committee under the China Computer Federation, it is an authoritative organization for academic research on high-performance computing, organizing academic conferences in the field of high-performance computing, and providing industry-university application services.

Based on the principle and mission of "building an academic platform, promoting industrial exchanges, advancing application implementation, balancing the software and hardware ecosystem, serving industry development, and connecting industry, academia, research and application", we are committed to promoting the research and development of China's high-performance computing field and building a high-performance computing academic and industrial cooperation and exchange platform. It plays an irreplaceable and important role and significance in supporting scientific and technological development and innovation, promoting social progress, and enhancing my country's comprehensive national strength and international competitiveness.

Get the PPT:Follow the WeChat public account "HyperAI Super Neuro", reply to the keyword "AI Compiler Beijing" in the background, and get the complete PPT of the guest.

HyperAI

Event Review | Shanghai Jiaotong University/Institute of Computing Technology, Chinese Academy of Sciences/Microsoft Asia Research/Zhiyuan Shared Practical Information, and the 5th Meet AI Compiler Technology Salon Ended Successfully

2 years ago

Information

Artificial Intelligence

Machine Learning

Deep Learning

We welcome more friends to join our AI Compiler Family and explore the infinite possibilities of AI compiler technology with us!

Event content review

The following is a brief introduction to the event and a video review of the event.

Share topic:MLCEngine: A Universal LLM Deployment Engine

Share Video:

【2024 Meet AI Compiler】Feng Siyuan-MLCEngine: A Universal LLM Deployment Engine_哔哩哔哩_bilibiliwww.bilibili.com/video/BV1Ji421Y7je/?vd_source=5e54209e1f8c68b7f1dc3df8aabf856c

Share topic:ElasticRoom: Multi-Tenant DNN Inference Engine via Co-design with Resource-constrained Compilation and Strong Priority Scheduling

Share Video:

【2024 Meet AI Compiler】Ma Lixian-ElasticRoom: Multi-Tenant DNN Inference Engine_哔哩哔哩_bilibiliwww.bilibili.com/video/BV1uE421P7zm/?vd_source=5e54209e1f8c68b7f1dc3df8aabf856c

Share topic:FlagGems, a Triton-based large model operator library, innovative practice

Share Video:

https://www.bilibili.com/video/BV1ES421R7o7/?vd_source=5e54209e1f8c68b7f1dc3df8aabf 856cwww.bilibili.com/video/BV1ES421R7o7/?vd_source=5e54209e1f8c68b7f1dc3df8aabf856c

2024 AI Compiler · Coming Soon

Let's work together to create the most active AI compiler community in China! Finally, let's share a group photo of the scene❤️

Organizers and partners

Visit the official website:https://hyper.ai/

Visit the official website:https://openbayes.com/

MLC Online Courses:https://mlc.ai/

TVM Chinese Documentation:https://tvm.hyper.ai/

Get the PPT:Follow the WeChat public account "HyperAI Super Neuro", reply to the keyword "AI Compiler Beijing" in the background, and get the complete PPT of the guest.

Command Palette

Event Review | Shanghai Jiaotong University/Institute of Computing Technology, Chinese Academy of Sciences/Microsoft Asia Research/Zhiyuan Shared Practical Information, and the 5th Meet AI Compiler Technology Salon Ended Successfully

Command Palette

Event Review | Shanghai Jiaotong University/Institute of Computing Technology, Chinese Academy of Sciences/Microsoft Asia Research/Zhiyuan Shared Practical Information, and the 5th Meet AI Compiler Technology Salon Ended Successfully

Related News

Event Preview | AI Computing, TileRT, Tencent, Huawei, and AI Computing Innovation Join Forces to Explore Multi-Level Collaborative Optimization

Early Bird Tickets Countdown! The AI Compiler Technology Salon in Beijing Is Coming!

Technical Salon | Zhiyuan, TileRT, Tencent, Huawei, and Zhiyuan Innovation Gather in Beijing to Focus on AI Compilation multi-level Optimization practices.

EnergAIzer, a GPU Power Estimation Framework Developed by MIT and Others, Completes Predictions in an Average of 1.8 Seconds With an Error of Approximately 81 TP3T.

Supports live-action/animation/animal-driven Video Generation; Meituan's open-source multi-style audio-driven Video Generation Framework LongCat 1.5 Enhances VLM's Chart Reconstruction and Table Extraction Capabilities Using the million-level Chart Understanding Dataset ChartNet.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Token Usage Decreased by 30%. Eywa, a Heterogeneous Intelligent Agent Framework Inspired by "Avatar," Efficiently Combines Language Models With domain-specific Basic models.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.

Command Palette

Event Review | Shanghai Jiaotong University/Institute of Computing Technology, Chinese Academy of Sciences/Microsoft Asia Research/Zhiyuan Shared Practical Information, and the 5th Meet AI Compiler Technology Salon Ended Successfully

Related News

Event Preview | AI Computing, TileRT, Tencent, Huawei, and AI Computing Innovation Join Forces to Explore Multi-Level Collaborative Optimization

Early Bird Tickets Countdown! The AI Compiler Technology Salon in Beijing Is Coming!

Technical Salon | Zhiyuan, TileRT, Tencent, Huawei, and Zhiyuan Innovation Gather in Beijing to Focus on AI Compilation multi-level Optimization practices.

EnergAIzer, a GPU Power Estimation Framework Developed by MIT and Others, Completes Predictions in an Average of 1.8 Seconds With an Error of Approximately 81 TP3T.

Supports live-action/animation/animal-driven Video Generation; Meituan's open-source multi-style audio-driven Video Generation Framework LongCat 1.5 Enhances VLM's Chart Reconstruction and Table Extraction Capabilities Using the million-level Chart Understanding Dataset ChartNet.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Token Usage Decreased by 30%. Eywa, a Heterogeneous Intelligent Agent Framework Inspired by "Avatar," Efficiently Combines Language Models With domain-specific Basic models.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.

Related News

Event Preview | AI Computing, TileRT, Tencent, Huawei, and AI Computing Innovation Join Forces to Explore Multi-Level Collaborative Optimization

Early Bird Tickets Countdown! The AI Compiler Technology Salon in Beijing Is Coming!

Technical Salon | Zhiyuan, TileRT, Tencent, Huawei, and Zhiyuan Innovation Gather in Beijing to Focus on AI Compilation multi-level Optimization practices.

EnergAIzer, a GPU Power Estimation Framework Developed by MIT and Others, Completes Predictions in an Average of 1.8 Seconds With an Error of Approximately 81 TP3T.

Supports live-action/animation/animal-driven Video Generation; Meituan's open-source multi-style audio-driven Video Generation Framework LongCat 1.5 Enhances VLM's Chart Reconstruction and Table Extraction Capabilities Using the million-level Chart Understanding Dataset ChartNet.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Token Usage Decreased by 30%. Eywa, a Heterogeneous Intelligent Agent Framework Inspired by "Avatar," Efficiently Combines Language Models With domain-specific Basic models.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.

Related News

Event Preview | AI Computing, TileRT, Tencent, Huawei, and AI Computing Innovation Join Forces to Explore Multi-Level Collaborative Optimization

Early Bird Tickets Countdown! The AI Compiler Technology Salon in Beijing Is Coming!

Technical Salon | Zhiyuan, TileRT, Tencent, Huawei, and Zhiyuan Innovation Gather in Beijing to Focus on AI Compilation multi-level Optimization practices.

EnergAIzer, a GPU Power Estimation Framework Developed by MIT and Others, Completes Predictions in an Average of 1.8 Seconds With an Error of Approximately 81 TP3T.

Supports live-action/animation/animal-driven Video Generation; Meituan's open-source multi-style audio-driven Video Generation Framework LongCat 1.5 Enhances VLM's Chart Reconstruction and Table Extraction Capabilities Using the million-level Chart Understanding Dataset ChartNet.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Token Usage Decreased by 30%. Eywa, a Heterogeneous Intelligent Agent Framework Inspired by "Avatar," Efficiently Combines Language Models With domain-specific Basic models.

Tencent open-sources Hy-MT1.5 Translation Model: 440MB Achieves top-tier Translation Capabilities; MIT Jointly Releases MathNet: a Multimodal Mathematical Inference Benchmark Covering 27,000 Real Olympiad Math problems.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.