AMD RDNA 3 2023 enthusiast
AMD Radeon RX 7800 XT
// 16 GB GDDR6 · 263W TDP · 37 TFLOPS FP32
ENTHUSIAST · RANK #4.1
▸ VRAM
16GB
▸ FP32
37TFL
▸ MEM BW
624GB/s
▸ TDP
263W
LLM Inference Performance
| Model | Tokens / sec | Local Fit |
|---|---|---|
| Mistral 7b Q4 | 80 tok/s | fits · single GPU |
| Llama 3 8b Q4 | 75 tok/s | fits · single GPU |
| Llama 3 13b Q4 | 42 tok/s | fits · single GPU |
| Llama 3 70b Q4 | — OOM — | OOM / offload |
Local Model Compatibility
7B params (int) fits
13B params fits
70B (4-bit quant) OOM
Spec Sheet
▸ COMPUTEA0
▸ ARCHITECTURE RDNA 3
▸ GPU CHIP Navi 32
▸ BOOST CLOCK 2430 MHz
▸ FP32 37 TFLOPS
▸ LAUNCH YEAR 2023
▸ MEMORY & RATINGSB0
▸ VRAM 16 GB GDDR6
▸ BANDWIDTH 624 GB/s
▸ TIER enthusiast
▸ OVERALL 4.1/5
▸ AI VALUE 3.6/5
▸ GAMING VALUE 4.4/5
▸ POWERC0
▸ TDP 263 W
▸ PERF/W (FP32) 0.141 TFL/W
▸ MODEL FITD0
▸ RUNS 7B (INT) yes
▸ RUNS 13B yes
▸ RUNS 70B (4-bit) no
▸ PLATFORM ROCm (CUDA unsupported)
Comparable GPUs
Analysis notes
Quick Summary
The RX 7800 XT AI case rests on 16GB of VRAM and strong bandwidth at a midrange price. For local 7B–13B inference it has the capacity and the memory speed; the friction, as ever with AMD for AI, is ROCm tooling rather than the hardware.
Specs That Matter for AI
16GB GDDR6 with 624 GB/s comfortably holds 13B quantized models, and the bandwidth keeps generation reasonably quick. The hardware is not the bottleneck — software maturity is.
Performance
~75 tok/s on Llama 3 8B q4 through ROCm or Vulkan backends. Fine for everyday local inference; expect more setup effort than an equivalent NVIDIA card.
Verdict
A strong value if you want 16GB, primarily game, and dabble in AI with a ROCm-ready toolchain. For frictionless AI, an NVIDIA 12–16GB card is the safer buy.
Frequently Asked Questions
- Is the RX 7800 XT good for AI?
- It has 16GB and solid bandwidth (624 GB/s) at a competitive price, enough for 7B–13B local models. The caveat is ROCm: setup is fiddlier than CUDA and some tools lag, so factor in extra configuration time.
- How fast is the RX 7800 XT for LLMs?
- Around 75 tok/s on Llama 3 8B q4 via ROCm/Vulkan backends — usable and respectable, though NVIDIA cards in this class generally run smoother out of the box.