I'm out of the loop for local models. For my M3 24gb ram macbook, what token thr...

dantetheinferno · 2025-08-05T19:30:26 1754422226

Apple M4 Pro w/ 48GB running the smaller version. I'm getting 43.7t/s

ivape · 2025-08-05T21:33:40 1754429620

Curious if anyone is running this on a AMD Ryzen AI Max+ 395 and knows the t/s.

albertgoeswoof · 2025-08-05T20:45:07 1754426707

3 year old M1 MacBook Pro 32gb, 42 tokens/sec on lm studio

Very much usable

steinvakt2 · 2025-08-05T19:04:59 1754420699

Wondering about the same for my M4 max 128 gb

jcmontx · 2025-08-05T19:20:50 1754421650

It should fly on your machine

steinvakt2 · 2025-08-06T07:06:27 1754463987

Yeah, was super quick and easy to set up using Ollama. I had to kill some processes first to avoid memory swap though (even with 128gb memory). So a slightly more quantized version is maybe ideal, for me at least.

Edit: I'm talking about the 120B model of course

coolspot · 2025-08-05T19:11:21 1754421081

40 t/s