HyperAIHyperAI

Deploy Quantized Models on CUDA

This article introduces how to use TVM automatic quantization (a quantization method of TVM).

Deploy Quantized Models on CUDA | Tutorials | HyperAI