At NVIDIA GTC 2021, Inspur Information introduced three new GPU servers (NF5468M6, NF5468A5 and NF5280M6), which fully support the latest NVIDIA A30, NVIDIA A10 and NVIDIA A100 Tensor Core GPUs to meet the demand for AI computing power in multiple computing scenarios, including running virtualized infrastructure for the NVIDIA AI Enterprise software suite. Inspur AI servers are among the first in the industry to fully support NVIDIA Ampere architecture-based GPUs and has obtained NVIDIA-Certified System status for these servers, which support the NVIDIA EGX platform for next-generation AI. Delivering AI Solutions for Industry: Currently, an Inspur server powered with A100 GPUs is in use at Northwestern University Feinberg School of Medicine, where the institute is pilot testing high-performance data pipelines to enable deep learning experiments without having to deal with separate, costly copies of legacy health system enterprise data. With the performance of Inspur’s A100-based training platform, the pilot program has provided significant performance improvements not just in model training but in overall project delivery. With a 10x improvement in training speed, and 100x improvement in data prep, Northwestern Medicine can rapidly prototype, iterate, and ultimately deploy deep learning models directly into the healthcare environment. Furthermore, Inspur servers supporting the NVIDIA Ampere architecture are also widely used in deep learning, image recognition, natural language understanding, intelligent recommendation and other intelligent scenarios in various industries, helping enterprise users accelerate AI innovation. Inspur’s All-New GPU Servers Supporting A30, A10 and A100: NF5468M6: ultra-flexible for AI workloads, supports 2x Intel 3rd Gen Intel® Xeon Scalable processor and 8x NVIDIA A100/A40/A30 GPUs, 16x NVIDIA A10 GPUs, or 20x NVIDIA T4 GPUs; supports up to 12x 3.5-inch hard drives for large local storage in a 4U chassis; flexibly adapts to latest AI accelerators and smartNICs and has the unique function of switching topologies with one click for various AI applications including AI cloud, IVA(Intelligent Video Analysis), video processing, etc. NF5468A5: versatile AI server featuring 2x AMD Rome/Milan CPUs and 8x NVIDIA A100/A40/A30 GPUs; N+N redundancy design enables 8x 350W AI accelerators in full-speed operations for superior reliability; the CPU-to-GPU non-blocking design allows interconnection without the PCIe switch communication, achieving faster commutation efficiency. NF5280M6: purpose-built for all scenarios, with 2x Intel 3rd Gen intel® Xeon Scalable processor and 4x NVIDIA A100/A40/A30/A10 GPUs or 8x NVIDIA T4 Tensor Core GPUs in 2U chassis, capable of long-term stable operation at 45°C. The NF5280M6 is equipped with the latest PFR/SGX technology and trusted security module design, which is suitable for demanding AI applications. Also, Inspur announced the brand-new Inspur M6 AI servers fully support NVIDIA BlueField- 2 DPUs. Moving forward, Inspur plans to integrate NVIDIA BlueField- 2 DPUs into its next-generation AI servers, which will enable faster and more efficient management of users and clusters as well as interconnected data access, for scenarios like AI, big data analysis, cloud computing, and virtualization. Inspur is the world's leading AI server vendor with a rich array of AI computing products and works closely with AI customers to help achieve high order-of-magnitude performance improvements for AI applications in speech, semantics, image, video, search, and more.