Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

What I think they’re claiming is that it’s a base model aimed for further fine tuning, that when further tuned might perform better than GPT-4 on certain tasks.

It’s an argument they make at least as much to market fine tuning as their own model.

This is not a generic model that outperforms another generic model (GPT-4).

That can of course have useful applications because the resource/cost is then comparatively minuscule for certain business use cases.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: