GPU5 categories compared
RTX 5070 vs RTX 4080 for LLMs: New Architecture or More VRAM?
The RTX 5070 brings NVIDIA Blackwell architecture at $549 but keeps 12 GB VRAM. The RTX 4080 is last-gen but offers 16 GB VRAM for around $750 used. For LLM inference, VRAM is often the bottleneck — making this a question of architecture speed vs raw memory capacity.