Broadcom Inc. announced that it has delivered the StrataXGS® Tomahawk® 5 switch series, providing 51.2 Terabits/sec of Ethernet switching capacity in a single, monolithic device, double the bandwidth of any other switch silicon available on the market. While data centers continue to experience dramatic growth in network bandwidth requirements, there is also strong motivation to unify the networking infrastructure for general-purpose compute and storage with that of AI/ML compute. AI/ML training clusters are driving the need for fabrics with high-bandwidth connectivity, high radix, and lower job completion time, while operating at high network utilization.

Ethernet offers the best solution for unified network infrastructure, providing the lowest power, highest bandwidth, highest radix, and fastest SerDes speeds, along with a predictable doubling of bandwidth every 18 to 24 months. These benefits combined with its large and vibrant ecosystem, Ethernet provides the highest performance interconnect per Watt and per dollar for AI/ML and cloud scale infrastructure. To enable the next generation of unified networks, Broadcom is now offering the Tomahawk 5 family.

Critical to enabling efficient use of the massively shared infrastructure in large data centers, Tomahawk 5 provides AI/ML workload virtualization with features such as single-pass VxLAN routing and bridging. Critical to minimizing job completion time (JCT) for AI/ML workloads, Tomahawk 5 offers features such as Broadcom Cognitive Routing, advanced shared packet buffering, programmable inband telemetry, and hardware-based link failover. Tomahawk 5's Cognitive Routing improves network link utilization by automatically and dynamically selecting the most lightly loaded links in the system for every flow that transverses the switch.

This is especially important for AI/ML workloads which often have a combination of short-lived mice flows and long-lived, high bandwidth elephant flows with low entropy. Tomahawk 5 includes real-time dynamic load balancing that tracks the utilization of all links, both at the switch and downstream in the network, to determine the optimal path for each flow. It also monitors the health of links in hardware and automatically steers traffic away from failed links.

These features provide dramatically improved network utilization and reduced congestion, resulting in shortened JCT.