Only Activating 3B Parameters Is Comparable to GPT-4o. Qwen3 Updated Late at Night, and the first-hand Test Is Here!

a year ago

In the early morning of July 29th, the Qwen team announced another major update - the already highly acclaimed Qwen3-30B-A3B model has ushered in a new version:Qwen3-30B-A3B-Instruct-2507.Machine learning enthusiast Vaibhav (VB) Srivastav was the first to share his feedback: "The latest Qwen3-30B-A3B-2507 runs extremely fast on a Mac equipped with MLX."

Focusing on the official data, this new model in non-thinking mode improves the ability to understand long texts to 256K.By activating only 3B parameters, you can achieve superb performance comparable to top closed-source models such as Gemini 2.5-Flash (non-thinking) and GPT-4o.at the same time,It has shown significant improvements in instruction following, logical reasoning, text comprehension, mathematics, science, programming, and tool usage.

Currently, "One-click deployment of Qwen3-30B-A3B-Instruct-2507" has been released to the OpenBayes public tutorial, and you can quickly experience the demo by cloning it with one click.We have conducted actual tests for everyone, asking whether there is a connection between the two extreme weather phenomena of heavy rain in many areas of Beijing and the typhoon landing in Shanghai. This model of non-thinking mode was tested and it quickly gave answers from multiple angles.

In addition, we have prepared surprise computing resource benefits for new users.Register with the invitation code "Qwen3-2507" to get 2 hours of dual-SIM A6000 usage time (resource valid for 1 month).The quantity is limited, don’t miss it!

Tutorial Link:

https://go.hyper.ai/s7Y7h

Demo Run

1. On the hyper.ai homepage, select the Tutorials page, choose One-click Deployment of Qwen3-30B-A3B-Instruct-2507, and click Run this tutorial online.

2. After the page jumps, click "Clone" in the upper right corner to clone the tutorial into your own container.

3. Select the NVIDIA RTX A6000-2 and PyTorch images. The OpenBayes platform has launched a new billing method: you can choose between "Pay as you go" or "Daily/Weekly/Monthly" according to your needs. Click "Continue." New users can register using the invitation link below to receive 4 hours of free RTX 4090 and 5 hours of free CPU time!

HyperAI exclusive invitation link (copy and open in browser):

https://go.openbayes.com/9S6Dr

4. Wait for resources to be allocated. The first clone will take about 2 minutes. When the status changes to "Running", click the jump arrow next to "API Address" to jump to the WebUI page. Please note that users must complete real-name authentication before using the API address access function.

Effect Demonstration

1. Extreme weather has been occurring frequently recently. After Beijing experienced a series of heavy rains, Shanghai was hit by a typhoon. Let's ask Qwen3-30B-A3B-Instruct-2507 whether there is any connection between the Shanghai typhoon and the Beijing heavy rains, and see what it says.

* After entering the API, if "Model" is not displayed in the upper left corner, it means the model is being initialized. Since the model is large, please wait about 2-3 minutes and refresh the page.

2. This version of the model is a new non-thinking mode model that objectively provides analysis from multiple perspectives.

The above is the tutorial recommended in this issue. Welcome everyone to experience it yourself~

Tutorial Link:

https://go.hyper.ai/s7Y7h

Only Activating 3B Parameters Is Comparable to GPT-4o. Qwen3 Updated Late at Night, and the first-hand Test Is Here!

a year ago

Information

Artificial Intelligence

Machine Learning

Tutorial Link:

https://go.hyper.ai/s7Y7h

Demo Run

1. On the hyper.ai homepage, select the Tutorials page, choose One-click Deployment of Qwen3-30B-A3B-Instruct-2507, and click Run this tutorial online.

2. After the page jumps, click "Clone" in the upper right corner to clone the tutorial into your own container.

HyperAI exclusive invitation link (copy and open in browser):

https://go.openbayes.com/9S6Dr

Effect Demonstration

* After entering the API, if "Model" is not displayed in the upper left corner, it means the model is being initialized. Since the model is large, please wait about 2-3 minutes and refresh the page.

2. This version of the model is a new non-thinking mode model that objectively provides analysis from multiple perspectives.

The above is the tutorial recommended in this issue. Welcome everyone to experience it yourself~

Tutorial Link:

https://go.hyper.ai/s7Y7h

Command Palette

Only Activating 3B Parameters Is Comparable to GPT-4o. Qwen3 Updated Late at Night, and the first-hand Test Is Here!

Demo Run

Effect Demonstration

Command Palette

Only Activating 3B Parameters Is Comparable to GPT-4o. Qwen3 Updated Late at Night, and the first-hand Test Is Here!

Demo Run

Effect Demonstration

Related News

Online Tutorials | Small Size, Big Code Power: Qwen3.6-27B Achieves Flagship-Level Programming Capabilities

Online Tutorial | Qwen 3.6 Series' First Open-Source Model Agent: Significantly Enhanced Programming Capabilities, Activation Parameters of Only 3B, Surpassing Gemma 4-31B

Zero-sampling TTS Breakthrough! A Few Seconds of Reference Audio, OmniVoice Helps You Easily Clone Hundreds of Languages; 17 Languages All in One Go: MDPbench Solves the Major Problem of Parsing low-resource Text systems.

Meta Proposes AI Data Scientists, and Autodata Builds high-quality training/evaluation datasets.

Achieve "voice-over Freedom" With Just 3 Seconds of Audio: Mistral open-source Speech Model Voxtral-4B-TTS-2603; Set a New Benchmark for Data Quality: Sutra 10B Pretraining.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.

Tutorial Summary | Open-source Small Models Achieve Overall Intelligence Comparable to GPT-5; one-stop Evaluation of Popular Models Such As Qwen 3.5/Gemma 4.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Paper Weekly Report | Microsoft MAI-Thinking Explores self-evolution of Pure RL, Achieving an AIME Accuracy of 97%; VLM³ Achieves 3D Task Generalization Using Plain Text Coordinates Without Architectural Modifications… A Quick Overview of the week's cutting-edge AI Papers

Command Palette

Only Activating 3B Parameters Is Comparable to GPT-4o. Qwen3 Updated Late at Night, and the first-hand Test Is Here!

Demo Run

Effect Demonstration

Related News

Online Tutorials | Small Size, Big Code Power: Qwen3.6-27B Achieves Flagship-Level Programming Capabilities

Online Tutorial | Qwen 3.6 Series' First Open-Source Model Agent: Significantly Enhanced Programming Capabilities, Activation Parameters of Only 3B, Surpassing Gemma 4-31B

Zero-sampling TTS Breakthrough! A Few Seconds of Reference Audio, OmniVoice Helps You Easily Clone Hundreds of Languages; 17 Languages All in One Go: MDPbench Solves the Major Problem of Parsing low-resource Text systems.

Meta Proposes AI Data Scientists, and Autodata Builds high-quality training/evaluation datasets.

Achieve "voice-over Freedom" With Just 3 Seconds of Audio: Mistral open-source Speech Model Voxtral-4B-TTS-2603; Set a New Benchmark for Data Quality: Sutra 10B Pretraining.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.

Tutorial Summary | Open-source Small Models Achieve Overall Intelligence Comparable to GPT-5; one-stop Evaluation of Popular Models Such As Qwen 3.5/Gemma 4.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Paper Weekly Report | Microsoft MAI-Thinking Explores self-evolution of Pure RL, Achieving an AIME Accuracy of 97%; VLM³ Achieves 3D Task Generalization Using Plain Text Coordinates Without Architectural Modifications… A Quick Overview of the week's cutting-edge AI Papers

Related News

Online Tutorials | Small Size, Big Code Power: Qwen3.6-27B Achieves Flagship-Level Programming Capabilities

Online Tutorial | Qwen 3.6 Series' First Open-Source Model Agent: Significantly Enhanced Programming Capabilities, Activation Parameters of Only 3B, Surpassing Gemma 4-31B

Zero-sampling TTS Breakthrough! A Few Seconds of Reference Audio, OmniVoice Helps You Easily Clone Hundreds of Languages; 17 Languages All in One Go: MDPbench Solves the Major Problem of Parsing low-resource Text systems.

Meta Proposes AI Data Scientists, and Autodata Builds high-quality training/evaluation datasets.

Achieve "voice-over Freedom" With Just 3 Seconds of Audio: Mistral open-source Speech Model Voxtral-4B-TTS-2603; Set a New Benchmark for Data Quality: Sutra 10B Pretraining.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.

Tutorial Summary | Open-source Small Models Achieve Overall Intelligence Comparable to GPT-5; one-stop Evaluation of Popular Models Such As Qwen 3.5/Gemma 4.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Paper Weekly Report | Microsoft MAI-Thinking Explores self-evolution of Pure RL, Achieving an AIME Accuracy of 97%; VLM³ Achieves 3D Task Generalization Using Plain Text Coordinates Without Architectural Modifications… A Quick Overview of the week's cutting-edge AI Papers

Related News

Online Tutorials | Small Size, Big Code Power: Qwen3.6-27B Achieves Flagship-Level Programming Capabilities

Online Tutorial | Qwen 3.6 Series' First Open-Source Model Agent: Significantly Enhanced Programming Capabilities, Activation Parameters of Only 3B, Surpassing Gemma 4-31B

Zero-sampling TTS Breakthrough! A Few Seconds of Reference Audio, OmniVoice Helps You Easily Clone Hundreds of Languages; 17 Languages All in One Go: MDPbench Solves the Major Problem of Parsing low-resource Text systems.

Meta Proposes AI Data Scientists, and Autodata Builds high-quality training/evaluation datasets.

Achieve "voice-over Freedom" With Just 3 Seconds of Audio: Mistral open-source Speech Model Voxtral-4B-TTS-2603; Set a New Benchmark for Data Quality: Sutra 10B Pretraining.

Extremely Lightweight, yet With Undiminished Image Quality! ERNIE-Image-Turbo: Say Goodbye to Long Waits, lightning-fast Speed; Introducing dual-dimensional Metrics of Perception and Cognition: Alibaba's Unified Multimodal Parsing and Evaluation Dataset OmniParsingBench Is Now online.

Tutorial Summary | Open-source Small Models Achieve Overall Intelligence Comparable to GPT-5; one-stop Evaluation of Popular Models Such As Qwen 3.5/Gemma 4.

Can Emojis Control Speech Generation? Irodori-TTS Is a Japanese TTS Based on the RF-DiT Architecture; Eczema and Tinea Skin Disease Datasets: Supporting Medical Image Classification and Transfer learning.

Paper Weekly Report | Microsoft MAI-Thinking Explores self-evolution of Pure RL, Achieving an AIME Accuracy of 97%; VLM³ Achieves 3D Task Generalization Using Plain Text Coordinates Without Architectural Modifications… A Quick Overview of the week's cutting-edge AI Papers