HyperAI

One-click Deployment of GLM-4-9B-Chat Demo

Running ChatGLM-4-9B-Chat on OpenBayes

This tutorial is a one-click deployment demo of ChatGLM-4-9B-Chat. You only need to clone and start the container and directly copy the generated API address to experience the inference of the model.

Model Introduction

GLM-4-9B is the open source version of the latest generation of pre-trained models in the GLM-4 series launched by Zhipu AI in 2024. In the evaluation of data sets in semantics, mathematics, reasoning, code, and knowledge, GLM-4-9B and its human preference-aligned version GLM-4-9B-Chat have shown superior performance beyond Llama-3-8B. In addition to being able to conduct multi-round conversations, GLM-4-9B-Chat also has advanced features such as web browsing, code execution, custom tool calls (Function Call), and long text reasoning (supporting up to 128K context). This generation of models has added multi-language support, supporting 26 languages including Japanese, Korean, and German. We have also launched the GLM-4-9B-Chat-1M model that supports 1M context length (about 2 million Chinese characters) and the multimodal model GLM-4V-9B based on GLM-4-9B. GLM-4V-9B is capable of multi-round bilingual conversations in Chinese and English at a high resolution of 1120 * 1120. In multi-modal evaluations of comprehensive Chinese and English abilities, perceptual reasoning, text recognition, and chart comprehension, GLM-4V-9B has demonstrated superior performance that surpasses GPT-4-turbo-2024-04-09, Gemini 1.0 Pro, Qwen-VL-Max, and Claude 3 Opus.

The model is the chat model of GLM-4 with 9B parameters.

How to run

  1. Clone the container, wait for the container to start, and then just copy the API address in the figure below and open it in the browser to experience the large model.
  1. Click below to enter text prompt
  1. Click Submit, generate the answer