QUALCOMM Incorporated : Making it possible to efficiently analyze video with AI

January 20, 2021 at 12:00 am EST

Making it possible to efficiently analyze video with AI

Qualcomm AI Research's latest research and techniques for efficient video perception

Jan 19, 2021

Qualcomm products mentioned within this post are offered by Qualcomm Technologies, Inc. and/or its subsidiaries.

The saying goes that a picture is worth a thousand words, so what does that imply for video? Video, which is essentially a sequence of static pictures, adds a temporal element and more context. Video perception, which means analyzing and understanding video content, with AI can provide valuable insights and capabilities for many applications ranging from autonomous driving and smart cameras to smartphones and extended reality. For example, autonomous driving uses video from multiple cameras for a variety of crucial tasks, including pedestrian, lane, and vehicle detection. Video perception is crucial for understanding the world and making devices smarter.

Video is pervasive across applications, devices, and industries.

Click to see a larger image

When talking about AI, people often ask if there's enough data. The answer is a definitive yes. Video data is abundant and being generated at ever-increasing rates. In fact, video is all around us, providing entertainment, enhancing collaboration, and transforming industries. The scale of video being created and consumed is massive - consider that close to 1 million minutes of video crosses the internet per second. However, only a tiny portion of this vast amount of video data, a drop in the ocean, is annotated for supervised learning. This motivates solutions that leverage unsupervised and semi-supervised learning.

Compute efficiency is essential for ubiquitous video perception

So, if video data is readily available and video analysis provides valuable information, why isn't AI being used more often for video perception? In mywebinar, I'll go into a number of data and implementation challenges, but the problem area that I want to focus on for this blog post is computational efficiency. As video resolution and frame rates increase while AI video perception models become more complex to improve accuracy, running these workloads in real time is becoming more challenging. Adding to the challenge is we want to run video perception on a diverse set of devices that often have power, thermal, compute, and memory constraints. Processing data closer to the source through on-device AI is important since it offers crucial benefits such as privacy, personalization, and reliability, in addition to helpingscale intelligence.

To scale, neural networks for video perception must be compute efficient.

Click to see a larger image

Efficiently running on-device video perception without sacrificing accuracy

AtQualcomm AI Research, our research goal for video perception is to achieve efficient solutions while maintaining and improving neural network model accuracy. Rather than doing brute force calculations, we try to remove unnecessary computations that do not degrade accuracy. Removing unnecessary calculations generally improves performance, lowers memory consumption, and saves power. The motivations for efficient video perception techniques that we've developed are centered around two key concepts: leveraging temporal redundancy and making early decisions.

Leveraging temporal redundancy and making early decisions are critical to efficient video perception

Click to see a larger image

Leveraging temporal redundancy to reduce computations across frames

Leveraging temporal redundancy means to take advantage of the fact that video frames are heavily correlated. The difference between two consecutive video frames is often minimal and contains little new information in most regions, so it is often unnecessary to analyze the entire image. We want to limit the computation only to the regions where there are significant changes. Learning to skip regions and recycling features are two novel techniques we've developed to take advantage of temporal redundancy in video.

For the learning to skip regions, we developed skip-convolutions for convolutional neural networks (CNNs). We introduce a skip-gate into a convolutional layer of a neural network to skip computations when the differences between the current and previous frame input features are negligible. The skip-gate itself is a tiny neural network that is trainable and computationally efficient. The net result is that the neural network learns to skip unnecessary computations while maintaining accuracy. For example, our skip-convolution technique applied to state-of-the-art object detection models has resulted in 3x-5x speed-up over state-of-the-art models without sacrificing model accuracy. What's also noteworthy is that skip convolutions are broadly applicable and can replace convolutional layers in any CNN for video applications.

The recycling features technique computes features once and uses them later rather than computing deep features of the neural network repetitively. The intuition behind this is that the deep features remain relatively stationary over time while shallow features contain the temporally varying information. Recycling features is applicable to any video neural network architectures, including segmentation, optical flow, classification, and more. On a semantic segmentation example, we saw a 78% reduction in computation and a 65% reduction in latency by using feature recycling. In addition, we saw a dramatic reduction in memory traffic, which significantly saves power.

Making early decisions to reduce computation

Making early decisions attempts to make easy decisions early by dynamically changing the network architecture per input frame. Early decisions, in essence, allow us to skip computation that is unnecessary for maintaining accuracy. Early exiting and frame exiting are two techniques that take advantage of making early decisions.

Early exiting exploits the fact that not all input examples require models of the same complexity to maintain accuracy. For complex input examples, very large models that are usually compute-intensive are needed to classify correctly. However, for simple input examples, very small and compact models can achieve very high accuracies, while only failing for complex examples. To take advantage of this, our neural network should be composed of a cascade of classifiers throughout the network. To make the early exit decision, we gate based on temporal agreement and frame complexity. Early exiting reduces compute while maintaining accuracy. For an object classifying example, exiting at the earliest possible neural network layer resulted in a 2.5X reduction in computations while maintaining accuracy.

Frame exiting uses a similar gating concept but attempts to skip computations on an entire input frame by making early decisions. For action recognition tasks, frame exiting not only reduces compute but also improves the accuracy of the model. By adding gates to the neural network architecture, deeper layers concentrate on the difficult decisions while earlier layers solve all the easy issues. This gating method also allows us to train models that tradeoff between accuracy and efficiency, allowing AI developers to customize the model for the use case requirements.

Looking beyond efficient video perception

Looking forward, our future research in video perception will look to advance existing efficiency techniques I discussed above while also developing new conditional compute solutions. We are bringing personalized processing, multi-task learning, sparse convolutions, unsupervised and semi-supervised approaches, quantization-aware training, and platform optimizations into our designs. In addition, our perception research is much broader than video. Besides video, we are driving high-impact machine learning and computer vision research efforts and inventing technology enablers in several areas of perception, from 3D and RF sensing to personalization and biometrics. We are focused on enabling advanced use cases for important applications, including XR, camera, mobile, autonomous driving, IOT, and much more. I look forward to a future with much more perceptive devices that enhance our everyday lives.

Join the webinar

Download the efficient video perception through AI presentation

Qualcomm AI Research is an initiative of Qualcomm Technologies, Inc.

Artificial Intelligence

Engage with us on

Twitter and LinkedIn

Opinions expressed in the content posted here are the personal opinions of the original authors, and do not necessarily reflect those of Qualcomm Incorporated or its subsidiaries ('Qualcomm'). Qualcomm products mentioned within this post are offered by Qualcomm Technologies, Inc. and/or its subsidiaries. The content is provided for informational purposes only and is not meant to be an endorsement or representation by Qualcomm or any other party. This site may also provide links or references to non-Qualcomm sites and resources. Qualcomm makes no representations, warranties, or other commitments whatsoever about any non-Qualcomm sites or third-party resources that may be referenced, accessible from, or linked to this site.

Fatih Porikli

Senior Director of Technology, Qualcomm Technologies

Market Closed - Nasdaq Other stock markets 04:00:00 2024-04-24 pm EDT			Pre-market 08:30:14 am
163.6 ^USD	+1.41%		166.5	+1.77%

12:33am	Teradyne sees strong Q2 results on steady chip-testing equipment demand	RE
Apr. 24	Qualcomm Announces Snapdragon X Plus with Oryon CPU for On-Device AI Applications	MT

Teradyne sees strong Q2 results on steady chip-testing equipment demand	06:33pm	RE
Qualcomm Announces Snapdragon X Plus with Oryon CPU for On-Device AI Applications	Apr. 24	MT
Qualcomm launches laptop processor	Apr. 24	CF
Taiwan chipmaker UMC warns of muted auto, industrial demand	Apr. 24	RE
Benchmark Starts Qualcomm With Buy Rating, $200 Price Target	Apr. 23	MT
UAE-based AI firm G42 announces collaboration with U.S. group Qualcomm	Apr. 22	RE
A little reprieve before the corporate earnings rush	Apr. 22
Qualcomm Incorporated Appoints Colin Ryan as Chief Strategy & Corporate Development Officer	Apr. 19	CI
TD Cowen Adjusts Qualcomm Price Target to $200 From $170	Apr. 18	MT
Qualcomm Raised Quarterly Dividend to $0.85 per Share From $0.80 per Share, Payable June 20 to Shareholders of Record May 30	Apr. 17	MT
Qualcomm Announces Quarterly Cash Dividend, Payable on June 20, 2024	Apr. 17	CI
Susquehanna Raises Price Target on Qualcomm to $205 From $175, Maintains Positive Rating	Apr. 10	MT
Vietnam's Viettel to develop semiconductor industry, prime minister says	Apr. 10	RE
Qt Group, Qualcomm Enter Partnership to Simplify Industrial IoT Graphics UI Development	Apr. 09	MT
TSMC Shares Rise After Winning $6.6 Billion Subsidy for Arizona Project	Apr. 09	DJ
TSMC wins $6.6 bln US subsidy for Arizona chip production	Apr. 08	RE
KeyBanc Raises Price Target on Qualcomm to $250 From $180, Keeps Overweight Rating	Apr. 08	MT
Qualcomm Inc/de Insider Sold Shares Worth $520,480, According to a Recent SEC Filing	Apr. 05	MT
Wall Street: stress and heavy trend reversal	Apr. 05	CF
Wall Street: stress and heavy trend reversal	Apr. 04	CF
Qualcomm Insider Sold Shares Worth $1,346,160, According to a Recent SEC Filing	Apr. 02	MT
China's red carpet draws CEOs but few expect their money to follow	Mar. 29	RE
Beijing Outlook, Ex-Dividend Dates Roil Asian Stock Markets	Mar. 28	MT
New AI benchmark tests speed of responses to user queries	Mar. 27	RE
NVIDIA Corporation : Silicon Valley wants to break Nvidia's CUDA software monopoly	Mar. 27

	1st Jan change	Capi.
QUALCOMM, INC.	+13.14%	183B
NVIDIA CORPORATION	+60.89%	1,961B
TAIWAN SEMICONDUCTOR MANUFACTURING COMPANY LIMITED	+32.04%	622B
BROADCOM INC.	+12.59%	582B
AMD (ADVANCED MICRO DEVICES)	+2.94%	245B
TEXAS INSTRUMENTS INCORPORATED	+2.55%	159B
INTEL CORPORATION	-31.34%	147B
MICRON TECHNOLOGY, INC.	+30.98%	124B
ARM HOLDINGS PLC	+32.92%	103B
ANALOG DEVICES, INC.	-1.04%	97.45B

Qualcomm, Inc.

Equities

QCOM

US7475251036

Semiconductors

QUALCOMM Incorporated : Making it possible to efficiently analyze video with AI

Latest news about Qualcomm, Inc.

Chart Qualcomm, Inc.

Company Profile

Income Statement Evolution

Analysis / Opinion

Ratings for Qualcomm, Inc.

Analysts' Consensus

EPS Revisions

Quarterly earnings - Rate of surprise

Sector Other Semiconductors