WED, 03 JUN 2026 · 18:32:12 UTC

Nvidia

FlagshipHardware

USA·HQ Santa Clara·Est. 1993

The GPU monopolist that powers virtually every frontier model.

9.0

our score

Our take

The dominant AI compute platform whose GPUs and software stack power virtually every frontier model, though antitrust and custom-silicon risks are rising.

At a glance

Best known for
AI datacenter GPUs and CUDA software ecosystem
Biggest strength
CUDA ecosystem + NVLink/InfiniBand bundling moat
Biggest risk
Hyperscaler custom silicon and China export controls
Stage
Public (NASDAQ:NVDA)
Primary revenue
Datacenter GPUs, AI compute platforms, and networking

What they do

Nvidia designs and sells graphics processing units (GPUs) and complete accelerated computing platforms. Originally built for PC gaming graphics, its datacenter chips—particularly the H100, H200, and the new Blackwell B200—have become the foundational infrastructure for modern artificial intelligence. Every major AI lab, cloud provider, and enterprise building large language models relies on Nvidia silicon for both training and inference. The company does not merely sell discrete chips; it sells entire systems, including DGX servers and HGX baseboards, tightly coupled with its proprietary NVLink interconnects and InfiniBand networking.

Beyond hardware, Nvidia operates as a full-stack computing company. Its CUDA parallel computing platform and software libraries, developed over nearly two decades, create powerful developer lock-in. More recently, it has pushed into software and services with offerings like DGX Cloud—essentially renting its own supercomputers through partner clouds—and Nvidia Inference Microservices (NIM), which package optimized models for enterprise deployment. The company sells primarily to hyperscale cloud providers (Microsoft, Amazon, Google, Meta, Oracle), enterprise IT departments, and scientific computing institutions. While gaming and professional visualization remain revenue lines, datacenter AI now drives the majority of growth and market valuation.

Origin story

Nvidia was founded in 1993 in Santa Clara, California, by Jensen Huang, Chris Malachowsky, and Curtis Priem with a focus on 3D graphics for personal computers. The company went public in 1999 and spent its first decade battling for dominance in the gaming GPU market against rivals like ATI. A pivotal long-term bet came in 2006 with the launch of CUDA, a parallel computing platform that enabled GPUs to run general-purpose software and laid the groundwork for modern AI workloads.

For years, CUDA served primarily scientific and high-performance computing niches. That changed after 2012, when deep neural networks demonstrated transformative results running on Nvidia hardware. The inflection point accelerated dramatically in 2022–2023, when the generative AI boom triggered by large language models created insatiable demand for the company's datacenter chips. What began as a gaming-focused graphics company has since become the dominant infrastructure provider for the AI era, with its market capitalization surging into the trillions as datacenter revenue eclipsed its original gaming business.

Key products

H100

2022

The Hopper-generation datacenter GPU that became the standard engine for training large language models and generative AI workloads.

H200

2023

An upgraded Hopper GPU featuring faster HBM3e memory, designed to improve inference performance for deployed AI models.

Blackwell B200

2024

Next-generation datacenter GPU architecture delivering substantial gains in training and inference efficiency for trillion-parameter models.

DGX Cloud

2023

A cloud service providing access to Nvidia DGX supercomputing infrastructure for AI training through partner cloud providers.

NIM

2024

Nvidia Inference Microservices, a set of optimized containers and APIs for deploying foundation models in enterprise environments.

Leadership

  • JH

    Jensen Huang

    Co-founder, President and CEO

    Driving strategy since 1993; previously at AMD and LSI Logic.

  • CK

    Colette Kress

    EVP and CFO

    Former CFO at Texas Instruments; manages finance since 2013.

Strengths & risks

Strengths

  • +CUDA software ecosystem with 20+ years of developer lock-in
  • +NVLink and InfiniBand creating unmatched system-level bundling
  • +Dominant market share in AI training and inference accelerators
  • +Full-stack vertical integration from chips to DGX to software
  • +Massive R&D scale enabling rapid architecture cadence

Risks

  • Hyperscaler custom silicon eroding long-term market share
  • US-China export controls cutting off major revenue streams
  • Antitrust scrutiny over CUDA bundling and networking dominance
  • Concentrated customer base with few cloud providers dominant
  • Supply chain dependence on TSMC for advanced node wafers

Recent moves

  1. Unveiled Blackwell B200 GPU architecture

    Mar 2024

    Introduced the next-gen Blackwell platform including the B200 GPU, targeting trillion-parameter model training and inference with major efficiency gains.

  2. Launched Nvidia Inference Microservices (NIM)

    Mar 2024

    Released NIM to simplify enterprise deployment of foundation models, shifting further up the software stack beyond raw hardware.

  3. Announced H200 GPU with HBM3e memory

    Nov 2023

    Unveiled the H200 upgrade to extend the Hopper generation's leadership in inference workloads before Blackwell's arrival.

  4. Agreed to acquire Run:ai

    Apr 2024

    Struck a deal to buy the Kubernetes-based AI workload orchestration startup to strengthen its AI enterprise software portfolio.

Competitive position

Nvidia's competitive position is best described as dominant but increasingly contested. In AI training, it faces no credible near-term alternative; AMD's MI300X offers competitive memory bandwidth and capacity, and Intel's Gaudi3 targets cost efficiency, yet neither has replicated the two-decade software moat of CUDA. The combination of optimized compilers, libraries, and developer mindshare means that migrating large AI workloads away from Nvidia remains prohibitively expensive and time-consuming for most organizations.

Where Nvidia is most vulnerable is in inference at massive scale and in hyperscaler-specific deployments. Google, Amazon, and Microsoft are aggressively deploying their own custom silicon—TPU v5p, Trainium2, and Maia 100 respectively—for internal workloads where they control the full stack. In networking, Broadcom and Marvell supply alternative interconnects, though Nvidia's ownership of Mellanox gives it an integrated advantage. Nvidia wins on time-to-market, ecosystem breadth, and system-level integration; it loses on unit cost, power efficiency in specific custom designs, and geopolitical access to the Chinese market.

What to watch

  • 01Blackwell production ramp yields and shipment volumes through 2025
  • 02Hyperscaler capex shifts toward custom silicon vs Nvidia share
  • 03China revenue trajectory and further US export control tightening
  • 04Adoption of open interconnect standards like UALink by clouds
  • 05Regulatory antitrust probes into CUDA bundling and networking

Frequently asked questions

Is Nvidia a hardware or software company?

It is both, but the CUDA software ecosystem and developer tools are arguably its deepest moat, locking customers in far beyond the physical chips.

What makes CUDA so difficult to replace?

Two decades of optimizations, libraries, and developer tooling make migrating large AI workloads to AMD or Intel prohibitively expensive and slow.

Who are Nvidia's largest customers?

Microsoft, Meta, Google, Amazon, and Oracle collectively account for the majority of datacenter GPU revenue through massive cluster purchases.

How do US export controls impact Nvidia?

Restrictions prevent shipment of top-tier chips to China, forcing cut-down alternatives and eliminating a historically significant revenue region.

What is the difference between H100 and Blackwell B200?

B200 is the next-generation architecture succeeding Hopper, offering substantially higher performance and efficiency for both training and inference.

Does AMD or Intel pose a real threat?

AMD's MI300X competes on raw specs and memory, but no rival has yet matched Nvidia's full integrated stack of silicon, software, and networking.

What is DGX Cloud?

It is Nvidia's AI-training-as-a-service offering, providing access to its DGX supercomputers hosted through partner cloud providers.

The bottom line

Nvidia sits at the center of the generative AI build-out, with Blackwell expected to drive the next wave of training clusters and inference deployments. Its integrated stack of GPUs, NVLink, and InfiniBand gives it pricing power and vendor lock-in that competitors struggle to replicate, cementing its role as the essential infrastructure layer for frontier labs.

However, the next three years will test whether this dominance is durable. Hyperscalers are pouring billions into custom silicon to reduce dependency, while AMD's MI300 series offers a viable alternative for inference. Meanwhile, US export controls have walled off China, and antitrust regulators in multiple jurisdictions are examining its bundling practices. The bull case depends on inference demand growing faster than training and Nvidia maintaining its full-stack edge. The bear case sees share erosion to in-house chips and a commoditization of AI accelerators. Investors and buyers should watch Blackwell ramp rates and hyperscaler silicon roadmaps as the leading indicators.

Visit Nvidia

Key products

  • H100
  • H200
  • Blackwell B200
  • DGX Cloud
  • NIM

Founders & leadership

All founders →

In the news

More →

Related companies

All companies →