Hopper — Local AI Glossary | CraftRigs

Hopper is NVIDIA's data-center GPU architecture introduced in 2022, best known for the H100 and H200 accelerators that became the default training silicon for frontier LLMs. For local AI builders, Hopper sits a tier above consumer Ada (RTX 4090) and a tier below the newer Blackwell generation announced at GTC 2026.

Architecture Context

Hopper succeeded Ampere (RTX 3090, A100) and brought fourth-generation tensor cores with native FP8 support, a Transformer Engine for mixed-precision attention, and dramatically wider memory bandwidth via HBM3. It is more efficient per watt than Ampere and pairs with NVLink interconnects designed for multi-GPU training clusters rather than single-rig desktops.

Hopper vs Consumer Cards

You will almost never see a Hopper card in a home rig. H100s ship as SXM modules or PCIe cards priced in the tens of thousands and require server chassis, not ATX cases. Builders running local LLMs lean on Ampere (RTX 3090) for cheap 24GB VRAM, Ada (RTX 4090) for the best consumer perf-per-watt, and watch Blackwell consumer trickle-down rather than chasing Hopper. The exception is used-market H100 PCIe cards appearing in prosumer servers, but power, cooling, and driver complexity make them a poor fit for a desk-side build.

Why It Matters for Local AI

Hopper sets the ceiling that consumer cards are measured against — when a model is "trained on H100s," that tells you the FP8 and HBM3 budget the original team had. For your local rig, Hopper matters mostly as context: it explains why frontier models keep getting larger (cluster bandwidth, not a single VRAM budget), and it anchors the price-performance gap that makes 24GB Ampere cards still the sweet spot for running quantized Llama and Qwen variants at home.