>We’ve observed that, throughout the course of training, SIMA 2 agents can perfo... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Workaccount2 3 months ago \| parent \| context \| favorite \| on: SIMA 2: An agent that plays, reasons, and learns w... >We’ve observed that, throughout the course of training, SIMA 2 agents can perform increasingly complex and new tasks, bootstrapped by trial-and-error and Gemini-based feedback. >In subsequent training, SIMA 2’s own experience data can then be used to train the next, even more capable version of the agent. We were even able to leverage SIMA 2’s capacity for self-improvement in newly created Genie environments – a major milestone toward training general agents across diverse, generated worlds. Pretty neat, I wonder how that works with Gemini, I suppose SIMA is a model (agent?) that runs on top of it?

FuckButtons 3 months ago [–]

That’s what it sounded like to me, a plain text interface between two distinct systems.

kridsdale1 3 months ago | [–]

That’s what Claude Plays Pokémon is.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact