AMD launches new DPUs to boost AI efficiency and network performance in data centres
AMD has launched new a range of data processing units (DPUs) to reduce strain on networks for data centre operators.
Unveiled at the company’s Advancing AI event in San Francisco, the Pensando Salina 400 and Pensando Pollara 400 are designed to enhance AI workload efficiency by improving networking routing to avoid traffic congestion.
AMD’s Salina 400 is designed for front-end networks. Featuring 16 N1 Arm cores, the DPU is designed for hyperscalers, enabling them to support intelligent load balances to utilise the complete bandwidth while minimising network congestion.
The new Salina 400 DPU optimises back-end networks, enabling performance to remain efficient during intense workloads such as AI training.
The new
Polara 400 networking adapter, meanwhile, is designed for back-end networks.
the Polara 400 NIC is the first-ever adapter designed to support the UEC standard for AI and high-performance computing data centre interconnects. Developed by the Ultra Ethernet Consortium, it’s seen as an alternative to InfiniBand, an interconnect standard largely used by hardware rival Nvidia.
Sitting at the heart of both of AMD’s new networking solutions is its
P4 engine, a compact fully programmable unit designed to optimise network workloads.
The P4 is capable of supporting 400 gigabytes per second (Gb/s) line rate throughput while multiple services run concurrently on the device.
The devices are essentially designed to ensure AI workloads in data centres powered by vast arrays of GPUs operate at peak efficiency. The hardware can effectively manage network congestion to avoid performance degradation by re-routing workloads to avoid network failures and quickly recover from occasional packet loss.
https://www.capacitymedia.com/article/amd-launches-new-dpus
Soni Jiandani, SVP and general manager of the network technology solutions group at AMD said in a press briefing that the Ethernet-based standard can scale to millions of nodes, compared to the foundational architecture of InfiniBand which is not poised to scale beyond 48,000 nodes “without making dramatic and highly complex workarounds.”
The Polara 400 is also programmable, enabling it to support further UEC-developed standards from release.
“Selena 400 and the Polara 400 are solving the challenges for both front-end and back-end networks, including faster data ingestion, secure access, intelligent load balancing, congestion management and fast failover, and loss recovery,” Jiandani said.