Everyone's watching Jensen Huang's keynote today, waiting to hear about chips with 288GB of HBM4 memory that cost more per rack than a house. None of that is for you.
That's not cynicism. It's the most useful thing someone can tell a home lab builder right now: the biggest GPU conference of the year will produce zero hardware you can actually buy, and the real opportunity is what happens to the used market in the 10 days that follow.
What Jensen Actually Announced
Vera Rubin is real, and it's genuinely impressive. The R100 GPU packs 288GB of HBM4 memory running at 22 TB/s bandwidth — 22 terabytes per second, a number that kind of breaks your brain if you sit with it. 336 billion transistors on TSMC's 3nm process. 50 petaflops of FP4 inference per GPU, which Nvidia claims is 5x faster than Blackwell. The Vera CPU it pairs with has 88 Arm cores and connects at 1.8 TB/s via NVLink-C2C, eliminating the PCIe bottleneck that plagues everything you've ever put in a personal rig.
Note
Vera Rubin R100 specs: 288GB HBM4 | 22 TB/s memory bandwidth | 50 PFLOPS FP4 | 336B transistors | TSMC 3nm | Ships H2 2026 | Datacenter-only, NVL72/NVL144 rack form factor
In an NVL72 rack configuration — 72 of these GPUs working together — you get 3.6 exaflops of FP4 inference and roughly 20.7 TB of total HBM. Single-rack inference on a 1.2 trillion parameter model, no multi-node sprawl. Nvidia says the cost-per-token drops 10x compared to Blackwell. Hyperscalers are excited for obvious reasons.
It's a hell of a machine. It's also sold exclusively to hyperscalers and enterprise AI factories. No consumer version is coming. Not a "Lite" variant, not a workstation SKU that lands 18 months later. Rubin is a datacenter play, full stop.
Feynman got maybe four minutes of stage time. Jensen teased a 1.6nm TSMC A16 chip targeting 2028, briefly mentioned silicon photonics integration and the Intel foundry partnership announced in January, and moved on. Real in the sense that Nvidia has committed to the roadmap publicly. Completely irrelevant in the sense that 2028 is two full GPU generations away and we have no consumer-facing specs to evaluate.
For the full architectural breakdown of what Vera Rubin means versus current hardware, see the Vera Rubin vs Hopper deep-dive.
Why None of This Matters to Your Build
People get confused about this every year. They watch the keynote, see the mind-bending numbers, and then sit on their hands. Waiting for trickle-down. It doesn't come. Not from chips like these.
Vera Rubin's HBM4 is not a memory architecture that translates to gaming cards or prosumer PCIe hardware. HBM is expensive and physically enormous — it's why the H100 cost $30,000 and lived on a server board. The Rubin GPU die is two reticle-sized chunks of silicon. The thing is massive. Your ATX case is not in the picture.
The RTX 5090, announced at CES in January 2025 and still barely available at MSRP in March 2026, is what Blackwell looks like when Nvidia eventually decides to sell it to regular people. That took about 18 months from Blackwell's enterprise launch. Whatever "Rubin for consumers" ends up being — assuming it happens at all — you're looking at 2028, right around when Feynman is getting its first real spec sheet.
Caution
Don't wait. "I'll buy when the new stuff trickles down" has cost home lab builders years of productive compute. The RTX 4090 is three years old and still handles 70B parameter models at INT4 quantization comfortably. The opportunity cost of waiting is real.
How GTC Historically Moves Used GPU Prices
This is the part nobody talks about loudly enough. GTC creates a buying window.
Not because anyone reads the Vera Rubin specs and concludes that it directly obsoletes their 4090. It happens because hype cycles are contagious. When 30,000 people watch Jensen announce the future is here, a meaningful subset of people who already own capable hardware get itchy to sell. They upgrade in their heads before their wallets catch up. They list cards.
The historical pattern is consistent. When Nvidia announced the RTX 3000 series in September 2020, the used market got flooded with RTX 2080 Ti cards within days. Sellers who'd been holding out dumped them — prices that had sat stubbornly above $600 started falling toward $300. The same thing happened in late 2022 when the 4000 series arrived: GPU prices on eBay fell across the board as people rushed to list 3000-series cards before the new generation made them feel stale.
GTC isn't a product launch for consumers. But it functions like one psychologically. The announcement creates a mental inflection point where sellers decide "now is the time." And crucially, it also shakes loose cards that have been sitting in people's desks while they waited for the "right moment."
The window typically runs 7 to 14 days after the keynote.
The RTX 4090 Used Market Right Now
Before GTC week, the 4090 used market looked like this:
- r/hardwareswap: $1,200–$1,500 for verified sellers, cards used for gaming or light AI work
- eBay: ~$2,200 average, with stale listings sitting at $2,500+ that aren't moving
- Jawa and similar platforms: $2,200 for "like new" condition
The $1,200 end of hardwareswap is genuinely excellent. That's a card that runs Llama 3 70B at INT4 quantization at 52 tokens per second — fast enough for real interactive use. 24GB of GDDR6X handles everything up to 70B models without breaking a sweat.
For comparison: the RTX 5090 runs the same model at 85 tokens per second. Impressive. But it costs $2,200+ even on a good day, draws 575W (which forces a PSU upgrade in most existing builds), and the median eBay buyer is still paying $3,775 due to scalper premiums. The RTX 5090 at $3,775 is not a better home lab investment than a 4090 at $1,387. The math isn't close. For a direct comparison of those two cards, the RTX 5090 vs 4090 breakdown has the numbers.
Tip
Set a saved search on r/hardwareswap for "4090" sorted by new. Filter for sellers with at least 5 confirmed trades. The deals move fast — have your PayPal ready and don't sleep on a card from a verified seller asking $1,300.
The Post-GTC Buy Window: When to Watch
Today is March 16. The keynote is happening or just wrapped.
The buy window opens in roughly 48 hours and runs through March 28.
What you're watching for: a small surge in listings across r/hardwareswap, eBay, and Facebook Marketplace. Cards from people who watched the keynote and felt the upgrade itch, or who've been sitting on hardware for months and decide that current GPU attention is a good moment to sell. It's not a dramatic price collapse — more like a 10-15% softening at the lower end of the range, and more inventory to negotiate on.
The mistake is buying during the keynote hype itself, in the first 24 hours. That's when prices are sticky and sellers are feeling confident. The actual softening happens after the news cycle moves on — roughly day 5 through day 12 post-keynote.
Also worth watching: the RTX 3090. These 24GB cards have gotten overlooked as 4090 prices stayed elevated. A clean 3090 at $430-$490 runs 13B-parameter models without issues and handles quantized 30B models with room to spare. Not the same card as a 4090 — but for a first local LLM rig, it's a legitimate entry point that the current hype cycle tends to undervalue. For the full breakdown on the 3090 versus AMD's current alternative, see the RTX 3090 vs RX 9070 XT comparison.
What Home Builders Should Actually Buy Right Now
Three honest tiers:
Under $700: RTX 3090. Prioritize cards from workstation or lightly-used gaming setups over ones that ran mining or 24/7 compute. Target $430-$520.
$1,000–$1,600: The 4090 on r/hardwareswap is the clear answer. 24GB VRAM covers every practical local inference workload you're likely to run. This is probably the last time the card is at these prices — once the post-GTC listing surge clears, hardwareswap inventory historically tightens back up.
Over $2,000: The RTX 5090 makes sense, but only near MSRP ($1,999). If you can't find one close to that, two used 4090s in NVLink gives you 48GB of combined VRAM for total model capacity — more than the 5090's 32GB — at roughly the same cost if you're patient about it. That's not a clean solution given NVLink limitations, but it's the honest comparison.
The one thing definitively not worth doing: waiting for Vera Rubin or Feynman derivatives to appear as consumer cards. By the time any Rubin architecture reaches a PCIe slot, your current hardware will have paid for itself in avoided API costs many times over. And whatever is on Jensen's roadmap after Feynman will be teased by then anyway.
Buy the 4090. Buy it between March 18 and March 28. Check hardwareswap first.
See Also
- Vera Rubin vs Hopper: What NVIDIA's GTC 2026 Announcement Means for Local AI Builders — the 30–50x performance gap explained
- RTX 3090 vs RX 9070 XT in 2026: The AMD Card That Changes the Equation — the sub-$900 GPU decision
- RTX 5090 vs RX 9070 XT for Local LLM: The Real Numbers — if you're considering the high end
- GPU Price Alert: MSI Is Warning of 15-30% Hikes — why the used market window is closing and what to buy before April
- The RTX 3090 Is Now the Best Value Local LLM GPU (March 2026 Price Guide) — current market pricing and where to find clean cards