Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

>We’ve observed that, throughout the course of training, SIMA 2 agents can perform increasingly complex and new tasks, bootstrapped by trial-and-error and Gemini-based feedback.

>In subsequent training, SIMA 2’s own experience data can then be used to train the next, even more capable version of the agent. We were even able to leverage SIMA 2’s capacity for self-improvement in newly created Genie environments – a major milestone toward training general agents across diverse, generated worlds.

Pretty neat, I wonder how that works with Gemini, I suppose SIMA is a model (agent?) that runs on top of it?



That’s what it sounded like to me, a plain text interface between two distinct systems.


That’s what Claude Plays Pokémon is.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: