> To build DeepSeek probably requires at least a $1B+ budget. Between their alleged 50,000 H100 GPUs, expensive (and talented) staff, and the sheer cost of iterating across numerous training runs - it all adds up to far, far more than their highly dubious claim of $5.5M.
This is not fair. Is OpenAI, for example, including the CEO paycheck for the model training costs?
There's a sliding scale. On one end is "Include the CEO's paycheck"; on the other is "include nothing except the price tag on the final, successful training run".
Neither end is terribly useful. Unfortunately, the $5.5M number is for the latter.
This is not fair. Is OpenAI, for example, including the CEO paycheck for the model training costs?