NVIDIA HGX Platform

Integrated with cutting-edge GPUs (Hopper and Blackwell series)
Supports 4-GPU and 8-GPU scalable baseboards
Up to 2.3 TB total memory capacity
High GPU-to-GPU bandwidth up to 1.8 TB/s
NVLink and NVSwitch for ultra-fast GPU interconnect
Supports FP8, FP16, TF32, and FP64 precision formats
Includes BlueField-3 DPUs for advanced security and networking
Compatible with Quantum InfiniBand and Spectrum-X Ethernet
Optimized with NVIDIA software stack (TensorRT, Magnum IO, etc.)
Up to 11× faster AI inference for large models like LLaMA 3.1
Up to 4× faster AI training with second-gen Transformer Engine
Ideal for AI training, inference, and HPC applications
Enables real-time performance for generative AI workloads
Built for data center scalability and efficiency
Seamless integration into cloud-native environments
Enhanced resource utilization with modular design
Delivers zero-trust security and high-performance cloud networking

NVIDIA HGX Platform: Redefining AI and HPC Performance in Modern Data Centers

The NVIDIA HGX platform stands at the forefront of accelerated computing, meticulously engineered to meet the rigorous demands of artificial intelligence (AI) and high-performance computing (HPC) workloads. By integrating cutting-edge GPUs, high-speed interconnects, and advanced networking technologies, HGX delivers unparalleled performance, scalability, and efficiency for data centers worldwide.

Core Components and Architecture

Advanced GPU Integration

At the heart of the HGX platform are NVIDIA’s latest GPUs, including the Blackwell Ultra and Hopper series. These GPUs offer significant enhancements in AI inference and training capabilities, enabling faster processing and improved energy efficiency.

High-Speed Interconnects

HGX leverages NVIDIA’s NVLink and NVSwitch technologies to facilitate rapid communication between GPUs. This architecture ensures efficient parallel processing, critical for complex AI and HPC tasks.

Scalable Configurations

The platform supports various configurations, such as 4-GPU and 8-GPU baseboards, allowing organizations to scale their infrastructure based on specific workload requirements.

Enhanced Networking Capabilities

With support for NVIDIA Quantum InfiniBand and Spectrum-X Ethernet, HGX provides high-bandwidth, low-latency networking solutions, essential for large-scale AI deployments.

Optimized Software Ecosystem

HGX is complemented by NVIDIA’s robust software stack, including TensorRT and Magnum IO, which streamline AI model training and inference workflows, ensuring optimal performance.

Performance Highlights

AI Inference

The HGX B300 NVL16 configuration delivers up to 11 times higher inference performance compared to previous generations, facilitating real-time processing of large language models like Llama 3.1 405B.

AI Training

Equipped with the second-generation Transformer Engine and FP8 precision support, HGX platforms achieve up to 4 times faster training speeds for large-scale AI models.

Memory and Bandwidth

HGX systems offer substantial memory capacities (up to 2.3 TB) and high GPU-to-GPU bandwidth (up to 1.8 TB/s), accommodating complex AI and HPC workloads.

Networking and Security in NVIDIA HGX Platform

The NVIDIA HGX platform is designed not just for raw computational power, but also for robust, high-performance networking and security infrastructure, enabling scalable and secure AI and HPC deployments in modern data centers.

BlueField-3 DPUs: Smart, Secure, and Scalable

At the core of the HGX networking stack are NVIDIA BlueField-3 Data Processing Units (DPUs). These programmable, high-performance DPUs handle data movement, security, and storage management — tasks traditionally performed by CPUs. With BlueField-3, enterprises can:

Implement zero-trust security models by isolating and encrypting data paths.
Offload CPU-intensive tasks such as firewall, intrusion detection, and storage virtualization.
Enable cloud-native infrastructure with features like composable storage and dynamic network provisioning.
Manage data center traffic more efficiently with telemetry, visibility, and orchestration tools.

This not only enhances performance and frees up CPU resources, but also enforces stronger security boundaries between workloads.

Spectrum-X Ethernet: Ethernet for AI

HGX platforms also support NVIDIA Spectrum-X, the world’s first Ethernet networking solution built specifically for AI workloads. Traditional Ethernet switches are not optimized for the unique traffic patterns of AI training — such as synchronized GPU communication and massive data throughput. Spectrum-X solves this by offering:

Lossless networking with near-zero congestion
High-throughput, low-latency switching for GPU clusters
Seamless integration with standard Ethernet infrastructure
Intelligent load balancing and adaptive routing for AI-specific patterns

With Spectrum-X, data centers can scale out AI infrastructure over Ethernet while maintaining performance similar to InfiniBand.

Quantum InfiniBand Support

For applications demanding the highest possible performance, HGX also supports NVIDIA Quantum InfiniBand, delivering:

Up to 400 Gbps of data transfer per port
Ultra-low latency communication
Efficient scaling of multi-node, multi-GPU systems
AI-specific acceleration features like in-network compute

InfiniBand is especially suited for supercomputing and massively parallel AI training where every millisecond counts.

Summary of Benefits:

BlueField-3 DPUs ensure secure, isolated, and efficient workload processing
Spectrum-X enables AI-optimized Ethernet networking
Quantum InfiniBand supports super high-speed, low-latency interconnects
Ideal for cloud-native, on-prem, and hybrid AI infrastructure

Use Cases

Generative AI

HGX platforms are ideal for training and deploying large-scale generative AI models, offering the computational power required for complex tasks.

High-Performance Computing

HGX supports HPC applications, including scientific simulations and data analytics, by providing the necessary computational resources and scalability.

Cloud Deployments

With its modular design and advanced networking capabilities, HGX is well-suited for integration into cloud environments, facilitating AI-as-a-Service offerings.

In summary, the NVIDIA HGX platform represents a significant advancement in AI and HPC infrastructure, delivering exceptional performance, scalability, and efficiency to meet the evolving demands of modern data centers.

Conclusion

The NVIDIA HGX platform represents a significant advancement in AI and HPC infrastructure, delivering exceptional performance, scalability, and efficiency to meet the evolving demands of modern data centers.

NVIDIA HGX Platform

Resources

Continue Exploring

Multi‑GPU SXM Architecture

Supports configurations with 4× or 8× SXM GPUs per baseboard—including A100, H100, H200, B200, or Blackwell—enabling flexible deployment scenarios.
High-Performance Tensor Cores

Delivers up to 144 PFLOPS FP4 Tensor Core (Blackwell) or 32 PFLOPS FP8 (H100) performance in 8-GPU nodes.
Fifth‑Generation NVLink/NVSwitch Interconnect

Enables GPU-to-GPU bandwidth of 1.8 TB/s and total NVLink bandwidth of 14.4 TB/s in advanced Blackwell B300 configurations.
Unified Memory & Massive Bandwidth

Offers up to 2.3 TB HBM3e memory (Blackwell) or 640 GB HBM3 (H100), with aggregate memory bandwidth up to 24 TB/s.
Scalable Networking & DPU Integration

Integrates high-speed InfiniBand (Quantum/NDR) and Spectrum-X Ethernet; includes optional BlueField DPUs for network offload, security, and composable storage.
Enterprise-Level Software Stack

Powered by NVIDIA Magnum IO, CUDA-X, and NGC catalog, and includes AI/HPC stack enhancements (e.g., Transformer Engines, MIG, SHARP, confidential computing).

NVIDIA HGX Platform

Purpose-built AI/HPC server reference design for OEMs, combining SXM GPUs, NVLink/NVSwitch interconnects, high-speed networking, optional DPUs, and a full AI/HPC software stack

NVIDIA HGX Platform

NVIDIA HGX Platform: Redefining AI and HPC Performance in Modern Data Centers

Core Components and Architecture

Advanced GPU Integration

High-Speed Interconnects

Scalable Configurations

Enhanced Networking Capabilities

Optimized Software Ecosystem

Performance Highlights

AI Inference

AI Training

Memory and Bandwidth

Networking and Security in NVIDIA HGX Platform

BlueField-3 DPUs: Smart, Secure, and Scalable

Spectrum-X Ethernet: Ethernet for AI

Quantum InfiniBand Support

Summary of Benefits:

Use Cases

Generative AI

High-Performance Computing

Cloud Deployments

Conclusion

Resources

Continue Exploring

NVIDIA HGX Platform

Related Products

Are you ready to unlock your network Capability?

Quick Access

Home

Orders

Account

Cart

Blog

Contact us

Categories

Server

Storage

Networking

Wireless

Access Point

Router

Brands

HP

Dell

Lenovo

Cisco

Mikrotik

Huawei

Privacy

Careers

Terms