Until now we know only what the they claim what that costs are.

diego_sandoval · 2025-01-28T04:32:09 1738038729

Inference costs can't be faked though, since the model can be run locally by anyone with capable hardware.

Even if the whole story about the training cost was fake, R1 and the distilled models are still very efficient at inference.

croes · 2025-01-28T04:59:28 1738040368

The shock for the industry was the claimed training costs and used hardware.

nextworddev · 2025-01-28T05:00:37 1738040437

unless OAI has all the optimization tricks already, which they probably do

creato · 2025-01-28T05:02:35 1738040555

Is the model architecture actually that different from anything else? Or are you just saying that you can get away with smaller models now?

trhway · 2025-01-28T04:28:43 1738038523

What DS did is in line with my expectations as I see a lot of performance optimizations possible at algorithmic level. So, even if the DS numbers are BS somebody else will tomorrow reach and even beat it.