Multi agent collaboration is quite likely the future. All agents have blind spot...

NitpickLawyer · 2025-12-27T18:57:16 1766861836

> Multi agent collaboration is quite likely the future

Autogen from ms was an early attempt at this, and it was fun to play with it, but too early (the models themselves kinda crapped out after a few convos). This would work much better today with how long agents can stay on track.

There was also a finding earlier this year, I believe from the swe-bench guys (or hf?), where they saw better scores with alternating between gpt5/sonnet4 after each call during an execution flow. The scores of alternating between them were higher than any of them individually. Found that interesting at the time.

paulirish · 2025-12-28T04:46:20 1766897180

The latter, if any else are curious: https://www.swebench.com/post-250820-mini-roulette.html

bahaAbunojaim · 2025-12-27T14:47:48 1766846868

Thank you so much for sharing Denis! I definitely believe in the that as the world start switching from single agent to agentic teams where each agent does have specific capabilities. do you know of any benchmarks that covers collaborative agents ?

DenisM · 2025-12-28T06:16:46 1766902606

You’re welcome.

I don’t know if benchmarks, sorry.