gpu//db
AMD RDNA 3 2023 enthusiast

AMD Radeon RX 7800 XT

// 16 GB GDDR6 · 263W TDP · 37 TFLOPS FP32
▸ AI VALUE
3.6/5
ENTHUSIAST · RANK #4.1
▸ VRAM
16GB
▸ FP32
37TFL
▸ MEM BW
624GB/s
▸ TDP
263W

LLM Inference Performance

Model Tokens / sec Local Fit
Mistral 7b Q4
80 tok/s
fits · single GPU
Llama 3 8b Q4
75 tok/s
fits · single GPU
Llama 3 13b Q4
42 tok/s
fits · single GPU
Llama 3 70b Q4
— OOM — OOM / offload

Local Model Compatibility

7B params (int) fits
13B params fits
70B (4-bit quant) OOM

Spec Sheet

▸ COMPUTEA0
▸ ARCHITECTURE RDNA 3
▸ GPU CHIP Navi 32
▸ BOOST CLOCK 2430 MHz
▸ FP32 37 TFLOPS
▸ LAUNCH YEAR 2023
▸ MEMORY & RATINGSB0
▸ VRAM 16 GB GDDR6
▸ BANDWIDTH 624 GB/s
▸ TIER enthusiast
▸ OVERALL 4.1/5
▸ AI VALUE 3.6/5
▸ GAMING VALUE 4.4/5
▸ POWERC0
▸ TDP 263 W
▸ PERF/W (FP32) 0.141 TFL/W
▸ MODEL FITD0
▸ RUNS 7B (INT) yes
▸ RUNS 13B yes
▸ RUNS 70B (4-bit) no
▸ PLATFORM ROCm (CUDA unsupported)

Comparable GPUs

Analysis notes

Quick Summary

The RX 7800 XT AI case rests on 16GB of VRAM and strong bandwidth at a midrange price. For local 7B–13B inference it has the capacity and the memory speed; the friction, as ever with AMD for AI, is ROCm tooling rather than the hardware.

Specs That Matter for AI

16GB GDDR6 with 624 GB/s comfortably holds 13B quantized models, and the bandwidth keeps generation reasonably quick. The hardware is not the bottleneck — software maturity is.

Performance

~75 tok/s on Llama 3 8B q4 through ROCm or Vulkan backends. Fine for everyday local inference; expect more setup effort than an equivalent NVIDIA card.

Verdict

A strong value if you want 16GB, primarily game, and dabble in AI with a ROCm-ready toolchain. For frictionless AI, an NVIDIA 12–16GB card is the safer buy.

Frequently Asked Questions

Is the RX 7800 XT good for AI?
It has 16GB and solid bandwidth (624 GB/s) at a competitive price, enough for 7B–13B local models. The caveat is ROCm: setup is fiddlier than CUDA and some tools lag, so factor in extra configuration time.
How fast is the RX 7800 XT for LLMs?
Around 75 tok/s on Llama 3 8B q4 via ROCm/Vulkan backends — usable and respectable, though NVIDIA cards in this class generally run smoother out of the box.

Sources