NetApp : How to speed up deep learning model training in the automotive sector

June 11, 2021 at 11:05 am EDT

Enabling lane detection at scale with NetApp, Run:AI, and Microsoft Azure

Today's automotive leaders are investing heavily in data-driven software applications to advance the most important innovations in autonomous and connected vehicles, mobility, and manufacturing. These new applications require an orchestration solution and a shared file system for their massive datasets to run distributed training of deep learning models on GPUs. The fascinating process for training AI models in the automotive industry involves many, many images used in a 3D matrix that's formed from 2D color images. These images are analyzed at the pixel and color (RGB) level to detect various objects, such as pedestrians, other cars, and traffic lights.

GPUs need to be maintained at high utilization to reduce training times, permit fast experimentation, and minimize the cost of usage. In addition, a high-performance, easy-to-use file system that prevents GPUs from waiting for data-'GPU starvation'-is imperative in accelerating model training in the cloud and optimizing cost.

Run:AI, Microsoft, and NetApp have teamed together to address a lane-detection use case by building a distributed training deep learning solution at scale that runs in the Azure cloud. This solution enables data scientists to fully embrace the Azure cloud scaling capabilities and cost benefits for automotive use cases.

How we set up our deep learning model training

Here are the tools we used, and how we used them:

Azure NetApp Files provided high-performance, low-latency, scalable storage through NetApp^®Snapshot^™ copies, cloning, and replication.
Azure Kubernetes Service (AKS) simplified deploying and orchestrating a managed Kubernetes cluster in Azure.
Azure compute SKUs with GPUs. These are specialized VMs available with single or multiple GPUs.
Run:AI enabled pooling of GPUs into two logical environments: one for build and one for training workloads. A scheduler manages the compute requests that come from data scientists, enabling elastic scaling from fractions of GPU to multiple GPUs and multiple GPU nodes. The Run:AI platform is built on top of Kubernetes, enabling simple integration with existing IT and data science workflows.
NetApp Trident integrates natively with AKS and its Persistent Volume framework and was used to seamlessly provision and manage volumes from systems running on Azure NetApp Files.
Finally, we did machine learning (ML) versioning by using Azure NetApp Files Snapshot technology combined with Run:AI. This combination perserved data lineage and allowed data scientists and data engineers to collaborate and share data with their colleagues.

What we found

By working with Run:AI, Azure, and NetApp technology, we enabled distributed computations in the cloud, creating a high-performing distributed training system. The system worked with tens of GPUs that communicated simultaneously in a meshlike architecture. And-to optimize cost-we were able to keep them fully occupied at about 95% to 100% utilization.

We were able to saturate GPU utilization and keep the GPU cycles as short as possible. (This is one of the highest-cost components in the architecture.) Azure NetApp Files provides various performance tiers that guarantee sustained throughput at submillisecond latency. We started our distributed training job on a small GPU cluster. Later, we added GPUs to the cluster on demand without interrupting the training-by using the dynamic service level change capabilities of Run:AI software to provide optimal GPU utilization.

Different data science and data engineering teams were able to use the same dataset for different projects. One team was able to work on lane detection, while another team worked on a different object detection task using the same dataset. Researchers and engineers were able to allocate volumes on demand.

We had full visibility of the AI Infrastructure. Using Run:AI's platform, we had full visibility of the AI infrastructure including all pooled GPUs, at the job, project, cluster, and node levels.

Looking to get started?

In this use case, lane detection for autonomous vehicles, we were able to use NetApp, Run:AI and Azure to create a single, unified experience for accelerating model training on the cloud, thus reducing costs while improving training times and simplifying processes for data scientists and engineers. Details are available in this technical report and apply to model training across industries and verticals.

Attachments

Original document
Permalink

Disclaimer

NetApp Inc. published this content on 11 June 2021 and is solely responsible for the information contained therein. Distributed by Public, unedited and unaltered, on 11 June 2021 15:04:00 UTC.

	1st Jan change	Capi.
NETAPP, INC.	+16.12%	21.13B
WESTERN DIGITAL CORPORATION	+34.49%	22.99B
PURE STORAGE, INC.	+46.66%	16.99B
SHANNON SEMICONDUCTOR TECHNOLOGY CO.,LTD.	+10.98%	2.37B
INNODISK CORPORATION	-0.80%	843M
QUANTA STORAGE INC.	+11.22%	782M
NETAC TECHNOLOGY CO., LTD.	-31.63%	662M
ARGOSY RESEARCH INC.	-3.79%	458M
NETLIST, INC.	-31.38%	329M
AUSTRIACARD HOLDINGS AG	+0.95%	246M

1st Jan change

Capi.

NETAPP, INC.

+16.12%

21.13B

WESTERN DIGITAL CORPORATION

+34.49%

22.99B

PURE STORAGE, INC.

+46.66%

16.99B

SHANNON SEMICONDUCTOR TECHNOLOGY CO.,LTD.

+10.98%

2.37B

INNODISK CORPORATION

-0.80%

843M

QUANTA STORAGE INC.

+11.22%

782M

NETAC TECHNOLOGY CO., LTD.

-31.63%

662M

ARGOSY RESEARCH INC.

-3.79%

458M

NETLIST, INC.

-31.38%

329M

AUSTRIACARD HOLDINGS AG

+0.95%

246M

Market Closed - Nasdaq Other stock markets 04:00:00 2024-04-15 pm EDT			5-day change	1st Jan Change
102.4 ^USD	+0.55%		-3.26%	+16.12%

Apr. 09	NetApp, Inc. Announces Executive Changes	CI
Mar. 21	Netapp Insider Sold Shares Worth $862,218, According to a Recent SEC Filing	MT

NetApp, Inc. Announces Executive Changes	Apr. 09	CI
Netapp Insider Sold Shares Worth $862,218, According to a Recent SEC Filing	Mar. 21	MT
NetApp, Inc. Empowers Customers to Securely Talk to Their Data in Collaboration with NVIDIA	Mar. 18	CI
NetApp, Inc. Appoints Alessandra Yockelson as Chief Human Resources Officer	Mar. 18	CI
Netapp Insider Sold Shares Worth $1,535,614, According to a Recent SEC Filing	Mar. 14	MT
BofA Securities Ups Price Target on NetApp to $85 From $78, Keeps Underperform Rating	Mar. 13	MT
Transcript : NetApp, Inc. Presents at Morgan Stanley?s Technology, Media & Telecom Conference 2024, Mar-05-2024 02:50 PM	Mar. 05
Citigroup Raises NetApp's Price Target to $110 From $90, Maintains Neutral Rating	Mar. 05	MT
Tesla, Apple and China’s delusions	Mar. 05
NetApp Turbocharge AI Innovation with Intelligent Data Infrastructure	Mar. 05	CI
NetApp Fights Ransomware in Real-Time with Built-In Artificial Intelligence on Enterprise Storage and Enhanced Cyber-Resiliency Solutions	Mar. 05	CI
North American Morning Briefing : S&P 500 Futures -2-	Mar. 05	DJ
ANALYST RECOMMENDATIONS : Dell, Domino's, Netapp, Okta, M&S...	Mar. 05
Argus Upgrades NetApp to Buy From Hold, Price Target is $130	Mar. 04	MT
Wall Street: record-breaking fireworks on Friday	Mar. 04	CF
Wall Street: record-breaking fireworks, semi-C +4% sector	Mar. 01	CF
S&P 500, Nasdaq Stretch Record Closing Runs	Mar. 01	MT
S&P 500 Climbs to a Fresh Record Close Led by Technology, Real Estate, Consumer Discretionary; Health Care Slips	Mar. 01	MT
S&P 500, Nasdaq Close Record Runs	Mar. 01	MT
US Equity Markets Close Higher Friday Following Manufacturing, Consumer Sentiment Data	Mar. 01	MT
Sector Update: Tech Stocks Sharply Higher Late Afternoon	Mar. 01	MT
Equities Rise Intraday as Investors Weigh Manufacturing Data, Consumer Sentiment	Mar. 01	MT
Weakness in Consumer Sentiment, Manufacturing Sends US Equity Indexes Higher, Treasury Yields Drop	Mar. 01	MT
Sector Update: Tech Stocks Sharply Higher Friday Afternoon	Mar. 01	MT
Sector Update: Tech	Mar. 01	MT

NetApp, Inc.

Equities

NTAP

US64110D1046

Computer Hardware

NetApp : How to speed up deep learning model training in the automotive sector

Latest news about NetApp, Inc.

Chart NetApp, Inc.

Company Profile

Income Statement Evolution

Analysis / Opinion

Ratings for NetApp, Inc.

Analysts' Consensus

EPS Revisions

Quarterly earnings - Rate of surprise

Sector Storage Devices