Fully Managed AI Platform: Access to NVIDIA’s high-performance AI infrastructure without the need for on-premises hardware.
Advanced GPU Clusters: Each instance includes 8 NVIDIA H100 or A100 80GB Tensor Core GPUs, totaling 640GB of GPU memory, suitable for training large-scale AI models.
High-Speed Networking: Utilizes NVIDIA NVLink and NVSwitch technologies for high-bandwidth, low-latency interconnects between GPUs, enhancing multi-node training performance.
Optimized Storage Solutions: Equipped with high-performance NVMe storage to support demanding AI workloads.
DGX Cloud Create: A Kubernetes-based platform for orchestrating AI workloads, facilitating efficient training and fine-tuning of models.
Serverless Inference: Deploy AI models with automatic scaling and efficient GPU utilization, eliminating the need for managing underlying infrastructure.
Benchmarking Tools: Provides templates and guidelines for evaluating model performance, supporting scalability up to 2,048 GPUs.
Multi-Cloud Support: Available through major cloud providers like Oracle Cloud Infrastructure, Microsoft Azure, Google Cloud, and Amazon Web Services, offering flexibility and global scalability.
Expert Support: Direct collaboration with NVIDIA engineers to optimize model performance and deployment strategies.
Predictable Pricing: Transparent monthly pricing starting at $36,999 per instance, inclusive of hardware, software, storage, and 24/7 support.
NVIDIA DGX Cloud: A Unified Cloud Platform for AI Development and Deployment
NVIDIA DGX Cloud is a comprehensive cloud platform developed by NVIDIA, designed to enable organizations to access advanced computational power for AI training and inference without the need for complex on-premises infrastructure.
NVIDIA DGX Cloud is integrated with leading cloud service providers including Oracle Cloud Infrastructure (OCI), Microsoft Azure, Google Cloud, and Amazon Web Services (AWS), offering global-scale access to high-performance computing resources.
Each DGX Cloud instance, equipped with 8 GPUs and 640 GB of GPU memory, starts at $36,999 per month.
This pricing includes hardware, software, storage, and 24/7 expert support.
NVIDIA DGX Cloud provides a robust end-to-end solution for organizations aiming to accelerate and scale their AI initiatives.
With state-of-the-art infrastructure, cutting-edge software platforms, and specialized support, DGX Cloud empowers enterprises to fast-track AI innovation across industries.
Category |
Details |
GPU Nodes |
8× NVIDIA A100 80 GB or H100 80 GB Tensor Core GPUs (640 GB total) |
Memory & Storage |
10 TB storage per instance; scalable egress/bandwidth (10 TB/month baseline) |
Network Fabric |
High-speed, low-latency interconnect for multi-node scaling |
Software Platform |
NVIDIA Base Command Platform, AI Enterprise, NIM APIs, NeMo Curator, serverless inference, benchmarking |
Support & Services |
24/7 expert support, technical account & customer success managers, single-point contact |
Pricing |
Predictable monthly rate including hardware, software, storage, egress, support |
Hybrid & Multi‑Cloud |
Deployable across public clouds and on-premise via unified Base Command interface |
Fully Managed Multi‑Node AI Platform
High-performance GPU clusters (8× A100/H100, 640 GB total GPU memory) delivered as a service with turnkey deployment .
NVIDIA‑Optimized Software Stack
Powered by NVIDIA Base Command Platform and AI Enterprise software—includes NIM microservices, NeMo Curator, serverless inference, and benchmarking workflows .
Serverless Inference with Autoscaling
Scales down to zero during inactivity, reducing costs and enabling flexible deployment via API/CLI/UI .
Cloud‑Agnostic Hybrid Integration
Available on multiple cloud partners with unified management across cloud and on-premises environments .
Expert Support & Predictable Pricing
Includes 24/7 support, dedicated technical account manager, and transparent monthly pricing covering compute, storage, egress, software, and consulting .
DGX Cloud Lepton Marketplace Access
Enables on‑demand access to GPU capacity from a global cloud-provider network, with real-time health insights and region-based workload sovereignty .
Discover the countless ways that Q9 technology can solve your network challenges and transform your business – with a free 30-minute discovery call.
At Q9, we have the skills, the experience, and the passion to help you achieve your business goals and transform your organization.
All rights reserved for Q9 technologies.