More

awestroke · 2026-04-07T18:55:46 1775588146

This is becoming a bit scary. I almost hope we'll reach some kind of plateau for llm intelligence soon.

lebovic · 2026-04-07T20:09:35 1775592575

A plateau is unlikely, at least for cybersecurity. RL scales well here and is replicable outside of Anthropic (rewards are verifiable, so setting up the training environment doesn't require that much cleverness).

The post also points out that the model wasn't trained specifically on cybersecurity, and that it was just a side-effect – so I think there's still a lot of headroom.

It's scary, but there's also some room for cautious non-pessimism. More people than ever can cause billions of dollars of damage in attacks now [1], but the same tools can be used for defensive use. For that reason, I'm more optimistic about mitigations in security vs. other risk areas like biosecurity.

[1]: https://www.noahlebovic.com/testing-an-autonomous-hacker/

hibikir · 2026-04-07T20:06:20 1775592380

On a topic like cybersecurity, we never win by not looking: One needs top of the line knowledge of how to break a system to be able to protect it. We have that dilemma dealing with human experts: The same government sponsored unit that tells you that you need to update your encryption can hold on to the information and use it to exploit it at their leisure.

Given that it's absolutely impossible to stop people not aligned with us (for any definition of us) from doing AI research, the most reasonable way forward is to dedicate compute resources to the frontier, and to automatically send reasonable disclosures to major projects. It could in itself be a pretty reasonable product. Just like you pay for dubious security scans and publish that you are making them, an LLM company could offer actually expensive security reviews with a preview model, and charge accordingly.

dist-epoch · 2026-04-07T20:56:32 1775595392

The immediate plateau is the energy output of the Sun captured by the Dyson Swarm around it. Until there it's smooth sailing.

esafak · 2026-04-07T19:53:04 1775591584

We need to promote alignment and other ethics benchmarks; we can't change what we don't measure. I don't even know any off the top of my head.

websap · 2026-04-07T19:09:37 1775588977

If we don't innovate, someone else will. This is the very nature of being a human being. We summit mountains, regardless of the danger or challenge.

vonneumannstan · 2026-04-07T19:16:44 1775589404

>If we don't innovate, someone else will.

Terrible take. You don't get to push the extinction button just because you think China will beat you to the punch.

>This is the very nature of being a human being. We summit mountains, regardless of the danger or challenge.

No, just no... We barely survived the Cold War, at times because of pure luck. AI is at least as dangerous as that, if not more. We have far exceeded our wisdom relative to our capabilities. As you have so cleanly demonstrated.

dist-epoch · 2026-04-07T20:59:36 1775595576

You assume there is the option of not pushing the extinction button. Nobody asked chimps if they wanted humans around. This processes are outside control.

awestroke · 2026-04-07T18:31:55 1775586715

I predict they will release it as soon as Opus 4.6 is no longer in the lead. They can't afford to fall behind. And they won't be able to make a model that is intelligent in every way except cybersecurity, because that would decrease general coding and SWE ability

chippiewill · 2026-04-07T18:48:11 1775587691

Alternatively they'll just wreck it down a bit so it beats a competitor but isn't unsafe.

awestroke · 2026-04-06T15:18:17 1775488697

The rewrite is excellent

awestroke · 2026-04-06T13:48:49 1775483329

Just have some autonomous killbot drones patrol the perimeter

awestroke · 2026-04-04T09:55:34 1775296534

It's shit, but most people don't know better

nkzd · 2026-04-04T10:21:57 1775298117

Which "claw" so you recommended?

awestroke · 2026-04-04T12:12:48 1775304768

None of them, but prefer ones written with engineering rigor and security in mind. Having an unvetted plugin ecosystem with code that runs unsandboxed is laughably naive

throwatdem12311 · 2026-04-04T12:19:27 1775305167

The one attached to your arm.

awestroke · 2026-04-04T09:53:37 1775296417

If you can open an elevated connection to your production db from your terminal, you're already toast

awestroke · 2026-03-23T11:47:20 1774266440

Perhaps when they switch over fully to Azure they'll forget to disable IPv6 access. One can dream

awestroke · 2026-03-19T20:36:22 1773952582

Not all of us enjoy being glazed mercilessly while getting subpar output

karmasimida · 2026-03-19T21:09:19 1773954559

I have burnt billion of tokens in gpt 5.4 and I didn’t know what you are talking about

solenoid0937 · 2026-03-21T01:46:00 1774057560

It's trash for larger codebases vs Opus unfortunately.

karmasimida · 2026-03-21T06:57:31 1774076251

Quite on the contrary for my experience. xhigh is the only model + thinking level that can reliably locate the bug

awestroke · 2026-03-19T20:35:36 1773952536

Why should anybody avoid bun? Just fork it if it ever changes license. In fact, I'm 100% sure it would be instaforked if Anthropic ever tried anything

awestroke · 2026-03-19T20:34:40 1773952480

Why should they pay money for such crappy software?

browningstreet · 2026-03-19T20:54:25 1773953665

This whole thread is people repeating wrong facts that have been clarified 100x in the previous threads on the same issue.

I wonder why conversation can never progress. When a stake goes in the ground, it never ever comes out.

FWIW OpenAI didn't buy OpenClaw.

mellosouls · 2026-03-20T09:49:40 1774000180

"now at OpenAI" were my original words - they did the equivalent of an acqui-hire and "protected" OpenClaw in a foundation.

In the context of the seemingly aggressive machinations of Anthropic your hair-splitting without clarifying beyond "OpenAI didn't buy OpenClaw" seems itself misleading and rather counter to helping conversations progress.

hirako2000 · 2026-03-19T21:12:54 1773954774

And Nvidia didn't buy Groq.

browningstreet · 2026-03-19T21:40:18 1773956418

When Peter gets tired of having a boss again, OpenAI will have zero OpenClaw.

Jgrubb · 2026-03-19T20:55:33 1773953733

Does your employer use Salesforce? Crappy software is practically the only software that anybody really pays for.

DanielHall · 2026-03-19T20:52:39 1773953559

OpenClaw is underwhelming, and its founder is basically a hype machine.