Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You've got to do piecemeal validation steps yourself, especially for models like Sonnet 3.7 that tend to over-generate code and bury themselves in complexity. Windsurf seems to be onto something. Running Sonnet 3.7 in thinking mode will sometimes reveal bits and pieces about the prompts they're injecting when it mentions "ephemeral messages" reminding it about what files it recently visited. That's all external scaffolding and context built around the model to keep it on track.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: