Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Why would it? Do you know how much it costs to finetune one of these models for such a niche language? I'm not just talking about the cost of training, but also the cost of acquiring data because there's much less data about niche languages.


96 x A100 hours for a finetune according to the article.

The cost of the dataset curation for a given language is hard to quantify as there are many unknowns. However, it seems perfectly crowdsourcable to volunteers.


A project like SETI@home should help these efforts I believe?


Maybe the stable horde could work it into the project.

https://stablehorde.net/


It certainly costs much less to the society to train for Pascal once than to make everyone burn CPU cycles running Python!


> Do you know how much it costs to finetune

Between 30-3000$, often in the 300$ range.


From their numbers 3 hours with 32 A100 80GBs.

From lambda cloud:

3 hours * 4 ~22$/hr for 8x A100 ~= $265

So yeah not too expensive even for a native fine tune (obviously this ignores all other costs other than the GPUs)


It's wild to me that the raw compute cost is so low.


Sweet Jesus, that is amazing.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: