Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Outside of prompt processing, the only reason GPU's are better than CPU's for inference is memory bandwidth, the performance of apple M* devices at inference is a consequence of this, not of their UMA.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: