Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> The more tools like Claude Code are used, the more training they receive as well.

What do you mean? A model doesn't improve because it's being used more. Are you saying Anthropic invests more into Claude Code the more people use it? Or are you saying they collect its output and train it on it?



They probably mean https://en.wikipedia.org/wiki/Reinforcement_learning_from_hu..., but I don't think that is a huge factor for Claude Code.


I assume they mean that they can gather users inputs (e.g. the user correcting the model, suggesting improvements, etc.).




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: