Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Why is Gemini Flash so much cheaper than other models here?


probably a mix of economies of scale (google workspace and search are already massive customers of these models meaning the build out is already there), and some efficiency dividends from hardware r&d (google has developed the model and the TPU hardware purpose built to run it almost in parallel)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: