Industry News

NVIDIA Releases H200, World's Most Powerful AI Chip: Performance Soars 90%, Doubles Llama 2 Inference Speed

Views : 13
Update time : 2023-12-09 10:24:38
        November 13, 2011 - NVIDIA today unveiled the next generation of AI supercomputer chips that will play an important role in deep learning and large-scale language models (LLMs), such as OpenAI's GPT-4. The new chips represent a significant leap forward from the previous generation, and will be used in data centers and supercomputers to handle tasks such as weather and climate prediction, drug discovery, quantum computing, and more. and other tasks.
        The key product release is the HGX H200 GPU, based on NVIDIA's "Hopper" architecture, which is the successor to the H100 GPU and the company's first chip to use HBM3e memory, which is faster and larger, and therefore better suited for large language models. According to NVIDIA, "With HBM3e, the NVIDIA H200 delivers 141GB of memory at 4.8 TB per second, nearly twice the capacity and a 2.4x increase in bandwidth compared to the A100." On the AI side, NVIDIA says the HGX H200 doubles the speed of inference on Llama 2 (70 billion parameter LLM) compared to the H100.The HGX H200 will be available in 4-way and 8-way configurations that are compatible with the software and hardware in the H100 system. It will be available for every type of data center (local, cloud, hybrid cloud and edge) and deployed by Amazon Web Services, Google Cloud, Microsoft Azure and Oracle Cloud Infrastructure, among others, and will be available in the second quarter of 2024.
        Another key product announcement from NVIDIA was the GH200 Grace Hopper "superchip," which combines the HGX H200 GPU with the Arm-based NVIDIA Grace CPU via the company's NVLink-C2C interconnect, and is officially designed for supercomputers. Designed specifically for supercomputers, it allows "scientists and researchers to solve the world's most challenging problems by accelerating complex AI and HPC applications running terabytes of data. The GH200 will be used in "more than 40 AI supercomputers at research centers, system manufacturers and cloud providers around the world," including Dell, Eviden, Hewlett-Packard Enterprise (HPE), Lenovo, QCT, and Supermicro. Notably, HPE's Cray EX2500 supercomputer will use the quad GH200, which scales up to tens of thousands of Grace Hopper superchip nodes. Perhaps the largest Grace Hopper supercomputer will be the GH200. Perhaps the largest Grace Hopper supercomputer is JUPITER at the Jülich facility in Germany, which will be "the world's most powerful AI system" when installed in 2024. It uses a liquid-cooled architecture, and its enhancement module consists of nearly 24,000 NVIDIA GH200 supercomputers interconnected by NVIDIA's Quantum-2 InfiniBand networking platform.
        NVIDIA says JUPITER will contribute to scientific breakthroughs in a number of areas, including climate and weather prediction, generating high-resolution climate and weather simulations with interactive visualizations. It will also be used in drug discovery, quantum computing and industrial engineering, many of which use customized NVIDIA software solutions that simplify development but also make supercomputing teams dependent on NVIDIA hardware. Last quarter, NVIDIA achieved record revenues of $10.32 billion ($13.51 billion total) in AI and data center alone, up 171 percent from a year ago, and NVIDIA is no doubt hoping that the new GPUs and supercomputing chips will help it continue that trend.
 
Related News
Read More >>
How many chips does a car need? How many chips does a car need?
Sep .19.2024
Automotive chips can be divided into four types according to their functions: control (MCU and AI chips), power, sensors, and others (such as memory). The market is monopolized by international giants. The automotive chips people often talk about refer to
Position and Function of Main Automotive Sensors Position and Function of Main Automotive Sensors
Sep .18.2024
The function of the air flow sensor is to convert the amount of air inhaled into the engine into an electrical signal and provide it to the electronic control unit (ECU). It is the main basis for determining the basic fuel injection volume. Vane type: The
Chip: The increasingly intelligent electronic brain Chip: The increasingly intelligent electronic brain
Sep .14.2024
In this era of rapid technological development, we often marvel at how mobile phones can run various application software smoothly, how online classes can be free of lag and achieve zero latency, and how the functions of electronic devices are becoming mo
LDA100 Optocoupler: Outstanding Performance, Wide Applications LDA100 Optocoupler: Outstanding Performance, Wide Applications
Sep .13.2024
In terms of characteristics, LDA100 is outstanding. It offers AC and DC input versions for optional selection, enabling it to work stably in different power supply environments. The small 6-pin DIP package not only saves space but also facilitates install