Nvidia has unveiled Grace, an Arm-based data centre CPU designed for giant-scale artificial intelligence (AI) and high-performance computing (HPC) applications.
This new processor combines Arm CPU cores with a low-power memory subsystem to help it analyse enormous datasets requiring both ultra-fast compute performance and massive memory.
Nvidia Grace, named after US programming pioneer Grace Hopper, is a highly specialised processor that will target workloads such as training next-generation NLP models that have over 1 trillion parameters, according to the company.
Furthermore, Nvidia claims that a Grace CPU-based system will deliver 10x faster performance than the current Nvidia DGX-based systems that run on x86 CPUs. Nvidia expects this new processor to service a niche segment of computing.
“Leading-edge AI and data science are pushing today’s computer architecture beyond its limits – processing unthinkable amounts of data,” said Jensen Huang, founder and CEO of Nvidia.
“Using licensed Arm IP, Nvidia has designed Grace as a CPU specifically for giant-scale AI and HPC. Coupled with the GPU and DPU, Grace gives us the third foundational technology for computing, and the ability to re-architect the data centre to advance AI. Nvidia is now a three-chip company.”
The Swiss National Supercomputing Centre (CSCS) and the US Department of Energy’s Los Alamos National Laboratory have already announced plans to build Grace-powered supercomputers.
This move could spell trouble for other chipmakers who already have a strong presence in the data centre market like Intel, which is currently dominating with a 90% share, and AMD. By promising a 10x increase in processing performance, it may cause some customers to take note of Grace. This was reflected in the markets where Intel and AMD are both down several percentage points following Nvidia’s announcement.
Nvidia also announced eight Nvidia Ampere architecture GPUs for next-generation laptops, desktops and servers.
The new Nvidia RTX A5000 and Nvidia RTX A4000 GPUs will help speed AI, graphics and real-time rendering up to 2x faster than previous generations in desktops. In laptops, the new Nvidia RTX A2000, Nvidia RTX A3000, RTX A4000 and RTX A5000 GPUs deliver accelerated performance without compromising mobility.
For data centres, the new Nvidia A10 GPU provides up to 2.5x the virtual workstation performance of the previous generation while the A16 GPU provides up to 2x user density with lower total cost of ownership and an enhanced virtual desktop infrastructure experience over the previous generation.
In February, it emerged that Nvidia had turned to some of its older graphic cards to meet the demand for GPUs during a global shortage of PC components and chipsets. Nvidia was going to re-release its old chips, such as the GTX 1050 Ti chip, which was meant to have been phased out two years ago, as well as the GeForce RTX 2060.