AIMET Model Zoo: Highly accurate quantized AI models are now available

January 22, 2021 at 10:04 pm EST

AIMET Model Zoo: Highly accurate quantized AI models are now available

8-bit integer models using the AI Model Efficiency Toolkit

Jan 22, 2021

Qualcomm products mentioned within this post are offered by Qualcomm Technologies, Inc. and/or its subsidiaries.

Making neural network models smaller is crucial for the widespread deployment of AI. Qualcomm AI Research has been developing state-of-the-art quantization techniques that enable power-efficient fixed-point inference while preserving model accuracy, such as Data Free Quantization (DFQ) and AdaRound, which are post-training techniques that achieve accurate 8-bit quantization without data.

To make this research more accessible and contribute to the open-source community, Qualcomm Innovation Center (QuIC) launched the AI Model Efficiency Toolkit (AIMET) on GitHub in May 2020. AIMET's goal is to enable power efficient integer inference by providing a simple library plugin for AI developers to utilize for state-of-the-art model efficiency performance. The AIMET project is flourishing with regularly updated quantization techniques based on work from Qualcomm AI Research and active use by the broader AI community, including multiple mobile OEMs, ISVs, and researchers in academia.

Leading quantization research is quickly being open sourced.

Click for larger image

QuIC is now taking it a step further by contributing a collection of popular pre-trained models optimized for 8-bit inference to GitHub in the form of 'AIMET Model Zoo.' Together with the models, AIMET Model Zoo also provides the recipe for quantizing popular 32-bit floating point (FP32) models to 8-bit integer (INT8) models with little loss in accuracy. The tested and verified recipes include a script that optimizes TensorFlow or PyTorch models across a broad range of categories from image classification, object detection, semantic segmentation, and pose estimation to super resolution, and speech recognition.

AIMET Model Zoo provides 8-bit quantized models for a variety of categories.

Click to see a larger image

This will allow researchers and developers direct access to highly accurate quantized models, saving them time in achieving performance benefits like reduced energy consumption, latency, and memory requirements for on-target inference. For example, imagine you are a developer wanting to do semantic segmentation for image beautification or autonomous driving use cases by using DeepLabv3+ model. AIMET Model Zoo provides an optimized DeepLabv3+ model using the DFQ and Quantization Aware Training (QAT) features from AIMET. The corresponding AIMET Model Zoo recipe points to this optimized model and provides proper calls to the AIMET library to run INT8 simulation and assess performance. In fact, the AIMET quantized version has a Mean Intersection over Union (mIoU) score of 72.08%, which is virtually equivalent to the 72.32% provided by the original FP32 model. The image below visually shows how the quantized model in AIMET Model Zoo results in accurate semantic segmentation.

Side-by-side comparison of FP32 model, 8-bit quantized AIMET model, and 8-bit quantized baseline model for DeepLabv3+ semantic segmentation. AIMET quantization results in accurate quantization, while the baseline quantization method is inaccurate.

Click to see a larger image

This is one example. The AIMET Model Zoo has many INT8 quantized neural network models that provide accurate inference comparable to FP32 models. With this initial contribution of 14 INT8 models to AIMET Model Zoo, we are easing the hurdles for the ecosystem in using quantized models in their AI workloads and thus marching toward making fixed-point power-efficient inference ubiquitous. You can get the best of both worlds -- the high accuracy of a floating-point model and the model efficiency of 8-bit integer models.

Check ourAIMET Model ZooandAIMET.

Qualcomm AI Research is an initiative of Qualcomm Technologies, Inc. AIMET and AIMET Model Zoo are products of Qualcomm Innovation Center, Inc.

SnapdragonDeveloperArtificial Intelligence

Engage with us on

Twitter and LinkedIn

Opinions expressed in the content posted here are the personal opinions of the original authors, and do not necessarily reflect those of Qualcomm Incorporated or its subsidiaries ('Qualcomm'). Qualcomm products mentioned within this post are offered by Qualcomm Technologies, Inc. and/or its subsidiaries. The content is provided for informational purposes only and is not meant to be an endorsement or representation by Qualcomm or any other party. This site may also provide links or references to non-Qualcomm sites and resources. Qualcomm makes no representations, warranties, or other commitments whatsoever about any non-Qualcomm sites or third-party resources that may be referenced, accessible from, or linked to this site.

Chirag Patel

Engineer, Principal/Mgr., Qualcomm Technologies

Market Closed - Nasdaq Other stock markets 04:00:00 2024-04-24 pm EDT			5-day change	1st Jan Change
163.6 ^USD	+1.41%		-0.42%	+13.14%

12:33am	Teradyne sees strong Q2 results on steady chip-testing equipment demand	RE
Apr. 24	Qualcomm Announces Snapdragon X Plus with Oryon CPU for On-Device AI Applications	MT

Teradyne sees strong Q2 results on steady chip-testing equipment demand	06:33pm	RE
Qualcomm Announces Snapdragon X Plus with Oryon CPU for On-Device AI Applications	Apr. 24	MT
Qualcomm launches laptop processor	Apr. 24	CF
Taiwan chipmaker UMC warns of muted auto, industrial demand	Apr. 24	RE
Benchmark Starts Qualcomm With Buy Rating, $200 Price Target	Apr. 23	MT
UAE-based AI firm G42 announces collaboration with U.S. group Qualcomm	Apr. 22	RE
A little reprieve before the corporate earnings rush	Apr. 22
Qualcomm Incorporated Appoints Colin Ryan as Chief Strategy & Corporate Development Officer	Apr. 19	CI
TD Cowen Adjusts Qualcomm Price Target to $200 From $170	Apr. 18	MT
Qualcomm Raised Quarterly Dividend to $0.85 per Share From $0.80 per Share, Payable June 20 to Shareholders of Record May 30	Apr. 17	MT
Qualcomm Announces Quarterly Cash Dividend, Payable on June 20, 2024	Apr. 17	CI
Susquehanna Raises Price Target on Qualcomm to $205 From $175, Maintains Positive Rating	Apr. 10	MT
Vietnam's Viettel to develop semiconductor industry, prime minister says	Apr. 10	RE
Qt Group, Qualcomm Enter Partnership to Simplify Industrial IoT Graphics UI Development	Apr. 09	MT
TSMC Shares Rise After Winning $6.6 Billion Subsidy for Arizona Project	Apr. 09	DJ
TSMC wins $6.6 bln US subsidy for Arizona chip production	Apr. 08	RE
KeyBanc Raises Price Target on Qualcomm to $250 From $180, Keeps Overweight Rating	Apr. 08	MT
Qualcomm Inc/de Insider Sold Shares Worth $520,480, According to a Recent SEC Filing	Apr. 05	MT
Wall Street: stress and heavy trend reversal	Apr. 05	CF
Wall Street: stress and heavy trend reversal	Apr. 04	CF
Qualcomm Insider Sold Shares Worth $1,346,160, According to a Recent SEC Filing	Apr. 02	MT
China's red carpet draws CEOs but few expect their money to follow	Mar. 29	RE
Beijing Outlook, Ex-Dividend Dates Roil Asian Stock Markets	Mar. 28	MT
New AI benchmark tests speed of responses to user queries	Mar. 27	RE
NVIDIA Corporation : Silicon Valley wants to break Nvidia's CUDA software monopoly	Mar. 27

	1st Jan change	Capi.
QUALCOMM, INC.	+13.14%	183B
NVIDIA CORPORATION	+60.89%	1,961B
TAIWAN SEMICONDUCTOR MANUFACTURING COMPANY LIMITED	+32.04%	622B
BROADCOM INC.	+12.59%	582B
AMD (ADVANCED MICRO DEVICES)	+2.94%	245B
TEXAS INSTRUMENTS INCORPORATED	+2.55%	159B
INTEL CORPORATION	-31.34%	147B
MICRON TECHNOLOGY, INC.	+30.98%	124B
ARM HOLDINGS PLC	+32.92%	103B
ANALOG DEVICES, INC.	-1.04%	97.45B

Qualcomm, Inc.

Equities

QCOM

US7475251036

Semiconductors

AIMET Model Zoo: Highly accurate quantized AI models are now available

Latest news about Qualcomm, Inc.

Chart Qualcomm, Inc.

Company Profile

Income Statement Evolution

Analysis / Opinion

Ratings for Qualcomm, Inc.

Analysts' Consensus

EPS Revisions

Quarterly earnings - Rate of surprise

Sector Other Semiconductors