NVIDIA and Nebius Group N.V. partner to develop and deploy the next generation of hyperscale cloud for the AI market, helping meet rapidly growing global demand for high-performance compute.

“Nebius has been built for AI since day one — not adapted from a general-purpose cloud, but designed for what developers actually need,” said Arkady Volozh, CEO of Nebius. “Now with NVIDIA, we are extending that throughout the stack — from gigawatt-scale AI factories to inference and software — as we build one of the first and largest clouds for all AI builders everywhere.”
Scaling full-stack AI Cloud
The partnership entails both companies collaborating on:
- AI factory design and support: Including access to partner design material, design review processes and acceptance, early samples and system software support, bring-up support, and regular system partner business and technical reviews.
- Inference: Creating a best-in-class inference and agentic AI stack for developers and enterprises with NVIDIA’s latest software technologies, optimised models and libraries.
- AI infrastructure deployment: Deploying multiple generations of NVIDIA infrastructure across Nebius’s platform through early adoption of NVIDIA computing architectures, including the NVIDIA Rubin platform, NVIDIA Vera CPUs and NVIDIA BlueField® storage systems.
- Fleet management: Optimising Nebius’s holistic fleet health by deploying NVIDIA’s latest GPU health monitoring and software recommendations.

AI is at another inflection point — agentic AI, driving incredible compute demand and accelerating infrastructure buildout,” said Jensen Huang, founder and CEO of NVIDIA. “Nebius is building an AI cloud designed for the agentic era, fully integrated from silicon to software and powered by NVIDIA’s next-generation accelerated compute. Together, we are scaling the cloud to meet the surging global demand for intelligence.”
