CraftRigs
Technical Report

GPU Supply Is Tightening Now — Here's What to Buy Before Prices Jump

By Charlotte Stewart 7 min read

Some links on this page may be affiliate links. We disclose it because you deserve to know, not because it changes anything. Every recommendation here comes from benchmarks, not budgets.

Japan and Germany Are Rationing High-End GPUs Right Now

You might've missed it, but GPU supply just tightened globally in late 2025. Not a government order — retail shops in Japan and Germany started rationing high-VRAM cards because they couldn't keep up with demand.

Here's what's actually happening:

Japanese retailers started voluntarily capping GPU sales in late 2025. Cards with 16GB or more VRAM — the RTX 5070 Ti, RTX 5080, RTX 5090, and most of the H-series — became restricted. The reason? GDDR7 memory production bottleneck. Samsung and Micron can't produce enough GDDR7 chips to feed both gaming and AI demand simultaneously.

A major German distributor announced allocation limits in early 2026: maximum 5 units per SKU (5070 Ti, 5080, 5090) to any single customer. That trickles down to retail partners, which means smaller shops run out faster.

This isn't hoarding or speculation. It's real supply constraint hitting the exact GPUs that local AI builders want most.

Why These Specific Cards?

The RTX 5070 Ti and 5080 are the sweet spot for running 70B models locally. The 5070 Ti fits $750 (MSRP) into most builders' budgets; the 5080 gives you 20% more VRAM for better quantization flexibility. Enterprise AI data centers want the same cards — H100s and L40S for their own workloads — but when upstream shortage hits, they cannibalize consumer supply by outbidding retail shops.

Budget GPUs (RTX 5050, RTX 5060 Ti) aren't interesting to data centers. They're still in stock everywhere because nobody's competing for them.

Current US Inventory — What You Can Actually Buy Today

As of early April 2026, US retail availability is already tightening:

  • RTX 5070 Ti: Constrained stock. ASUS reportedly halted production. Real-world prices are $880-$1,069, not the $749 MSRP. 3-5 day delivery at limited retailers; many are out of stock.
  • RTX 5080: Limited stock. Prices $1,150-$1,249 (MSRP was $999). Most big retailers sold out or accepting pre-orders with no ship date.
  • RTX 5090: Pre-order only. 4+ week lead times reported.
  • RTX 5060 Ti: Healthy availability. $399-$449 across Best Buy, Newegg, Amazon. Next-day shipping in most metros.
  • RTX 5050: Excellent stock. $199-$249, readily available.
  • Used RTX 4080 Super: Stable secondary market. Prices holding at $650-$750.

The gap between MSRP and reality is already 15-40%. That gap widens when shortages deepen.

The Timeline — When Does the US Really Feel This?

Here's what we know from past shortage cycles:

2020-2021: GPU shortage that lasted 18+ months. Prices peaked at 2-3x MSRP for high-end cards. Took until late 2022 for inventory to normalize.

Late 2022-2023: H100 shortage in enterprise. Retailers had 12+ week backlog for H100s by end of 2023. Prices jumped 15-20% as enterprise demand outpaced supply.

Japan and Germany retail rationing started ramping up in late 2025. Historically, when upstream inventory tightens in Asia, US retail feels it within 6-8 weeks. That puts us in the danger zone right now — April to June 2026.

What's different this time? The shortage is AI-driven and structural, not temporary. Data center demand for GPUs is growing 30% year-over-year. It's not speculative buying like crypto in 2021 — it's real workloads. Retail supply can't compete with enterprise purchasing power.

Warning

If you're planning to buy a high-end GPU (RTX 5070 Ti, 5080, 5090), the next 2-4 weeks are your window. After that, lead times stretch and prices rise. This isn't fear-mongering — it's supply math.

Which GPUs to Buy This Week vs. Which Can Wait

Buy immediately (this week):

  • RTX 5070 Ti ($880-$1,069) — end-of-life, inventory will evaporate
  • RTX 5080 ($1,150-$1,249) — limited stock, prices rising weekly
  • RTX 5090 ($1,999+) — pre-order only, fastest to shortage

Wait 2-3 weeks (still safe):

  • RTX 5070 ($649-$699) — lower demand, more stable
  • RTX 4080 Super ($850-$950 new, $650-$750 used) — last-gen supply is solid
  • RTX 5060 Ti 8GB ($399-$449) — abundant, no urgency

Safe to wait (no shortage pressure):

  • RTX 5050 ($199-$249) — enterprise ignores these
  • RTX 5060 Ti ($399-$449) — mid-range, stable supply
  • Used RTX 4070 Super / 4080 / 4090 — secondary market hasn't tightened yet

Price Expectations — What Happens When Shortage Hits

We can't predict the exact jump, but historical patterns matter:

  • 2021 shortage: Some GPUs hit 200-300% of MSRP before normalizing
  • 2023 H100 shortage: Enterprise demand created 15-25% premiums over list
  • Average constraint period: 6-12 weeks before upstream production catches up

If the RTX 5070 Ti is $880 today (already 17% over MSRP), and rationing hits harder in May-June, don't be shocked to see it push toward $1,050-$1,150 by summer. That's an extra $170-$270 you'd pay later.

Paying today's inflated price locks you in. Waiting gambles on prices staying flat or dropping — unlikely during a shortage.

Strategy by Build Timeline

Building this month (April 2026)? Buy your GPU this week. Don't wait.

Building in May-June? Grab your RTX 5070 Ti or 5080 right now, but you can wait on power supply, case, RAM, and CPU. Secondary parts aren't constrained.

Building in July or later? You're taking a risk. Shortage windows are unpredictable. Secure your GPU by April 30. You can always hold it a few weeks before assembly.

On a tight budget ($500-$1,000)? RTX 5060 Ti is your safest move. It has zero shortage pressure and gives you 8GB VRAM for any model up to 14B. Prices stable, no FOMO required.

Multi-GPU builder (dual RTX 5080 or 5090 + 5080)? Buy both cards this week if you can. NVLink support is confirmed for local inference. Scarcity will hit the 5080 and 5090 simultaneously — don't count on buying the second card in May.

Where to Buy (and What to Avoid)

In stock right now:

  • Best Buy (RTX 5070 Ti limited stock, 3-5 day delivery; price match available)
  • Newegg (RTX 5070 Ti $880+; frequently out of stock)
  • Amazon (RTX 5060 Ti 8GB readily available next-day in most metros)
  • B&H Photo (pro pricing, 1-day NYC delivery)
  • Microcenter (in-store pickup same day if you're local)

Tracking tools:

  • gputracker.com — alerts for Best Buy, Newegg, Amazon drops
  • Tom's Hardware GPU price tracker — historical pricing and current deals
  • Camelcamelcamel — Amazon price history and delivery tracking

Avoid:

  • Pre-orders beyond 30 days (too much cancellation risk during shortages)
  • Marketplace sellers charging 2-3x MSRP (they're scalpers, not real deals)
  • Vague "ships within 60 days" listings (lead times will stretch)

Decision Tree — Buy Now or Wait?

Are you building in April-May? → Buy this week.

Are you building in June-July? → Buy your GPU by April 30. Prices will be higher by then.

Are you building in August+? → Buy by April 30 anyway. Shortages are unpredictable; don't gamble on "things normalizing" later.

Budget capped at $500-$800? → RTX 5060 Ti is safe, can wait 3 weeks.

Multi-GPU setup? → All cards this week.

Can't decide between 5070 Ti and 5080? → Buy the 5070 Ti. It's cheaper, end-of-life (supply scarcer), and still handles every local LLM up to 70B at Q4 quantization. The 5080 is nice but only worth the extra $300-$400 if you're regularly running 70B+ models.

CraftRigs Take

This shortage is different from 2021 because it's permanent, not cyclical. Crypto miners eventually abandoned GPU mining. Data centers won't abandon AI. Enterprise demand for GPUs is growing 30% annually and will keep growing. Retail builders are losing the supply war — not by accident, but by market forces.

You're not being irrational if you buy an expensive GPU today. You're pricing in the cost of losing access to it later. The math is simple: RTX 5070 Ti at $880 now beats RTX 5070 Ti at $1,100-$1,200 in June.

This isn't FOMO. It's just constraints catching up to demand.

FAQ

Why is GDDR7 specifically the bottleneck?

GDDR7 is new memory tech (launched 2024-2025). Only Samsung and Micron make it, and they're ramping production but can't keep up with simultaneous demand from gaming GPUs (RTX 50-series), AI GPUs (H100, H200), and enterprise data centers. It's a classic supply bottleneck — one component behind, everything queues up.

Should I buy used (RTX 4080 Super, RTX 4090) instead of new?

Used secondary market lags 2-3 weeks behind new shortages. If new stock evaporates in May, used prices will rise too, but you have a fallback. Good fallback strategy: buy used 4080 Super ($650-$750) this week if new cards sell out before you can grab one.

Is buying internationally (from UK/EU shops) a workaround?

No. Shipping ($50-$150) plus import duty plus 2-3 week delivery erodes any savings. By the time it arrives, you've missed the US shortage window anyway or paid more overall. Not worth it.

Will AMD Radeon RX 7000 series fill the gap?

AMD inventory is stable (enterprise hasn't switched to AMD at scale yet). But expect AMD prices to rise 5-10% when NVIDIA supply evaporates — demand will shift. RDNA 4 (next-gen AMD) doesn't launch until mid-2026.

What about last-gen RTX 4000-series?

Stable supply through 2026. RTX 4070 Super ($449-$499), RTX 4080 Super ($650-$750), RTX 4090 used ($800-$950). These handle any local LLM under 70B. Consider them if 5000-series availability fails you.

If I order today and receive in May, will my GPU be obsolete by then?

No. GPU cycles are 12-18 months; RTX 50-series (launched January 2025) won't have a successor until late 2026 or 2027. Buying an RTX 5070 Ti in May 2026 is still a good card for 2027.

Can I return a GPU if prices drop later?

Best Buy has 15-day no-questions returns; Newegg 30-day. Use that window if the shortage prediction misses and prices fall. But historically, they don't fall during shortage periods — they stabilize once production catches up.

What if I miss the window and prices jump 20%?

You'll have longer lead times and pay more, but you won't be blocked entirely. Last resort: buy a used 4080 Super for $700-$750, use it for 3-4 months, flip it when new stock is back in supply. Net cost: ~$100-$150 in depreciation rather than $250+ price markup on new cards.


gpu-shortage rtx-5070-ti rtx-5080 local-llm-hardware gpu-buying-guide

Technical Intelligence, Weekly.

Access our longitudinal study of hardware performance and architectural optimization benchmarks.