Outside of prompt processing, the only reason GPU's are better than CPU's for in...

		mistercheph 3 months ago \| parent \| context \| favorite \| on: 25L Portable NV-linked Dual 3090 LLM Rig Outside of prompt processing, the only reason GPU's are better than CPU's for inference is memory bandwidth, the performance of apple M* devices at inference is a consequence of this, not of their UMA.