More

AlexCoventry · 2026-02-25T04:13:00 1771992780

I'm not blaming you, but it's scary how many people are running these agents as if they were trusted entities.

hansvm · 2026-02-25T12:12:19 1772021539

https://news.ycombinator.com/item?id=47150476

notepad0x90 · 2026-02-25T05:57:24 1771999044

they're tools, you don't ascribe trust to them. you trust or distrust the user of the tool. It's like say you trust your terminal emulator. And from my experience, they will ask for permission over a directory before running. I would love to know how people are having this happen to them. If you tell it it can make changes to a directory, you've given it every right to destroy anything in that directory. I haven't heard of people claiming it exceeded those boundaries and started messing with things it wasn't permitted to mess with to begin with.

fragmede · 2026-02-25T10:15:40 1772014540

That would be --dangerously-skip-permissions for Claude, and --dangerously-skip-permissions for codex.

Aka yolo mode. And yes, people (me) are stupid enough to actually use that.

notepad0x90 · 2026-02-25T14:39:05 1772030345

It's a people problem then. not blaming here, I'm just saying it isn't the tool being untrustworthy. I too get burned badly when I play with fire.

AnimalMuppet · 2026-02-25T13:49:33 1772027373

OK, but we learned decades ago about putting safety guards on dangerous machinery, as part of the machinery. Sure, you can run LLMs in a sandbox, but that's a separate step, rather than part of the machinery.

What we need is for the LLM to do the sandboxing... if we could trust it to always do it.

notepad0x90 · 2026-02-25T14:42:58 1772030578

Again, the trust is for the human/self. it's auto-complete, it hallucinates and commits errors, that's the nature of the tool. It's for the tools users to put approprite safeguards around it. Fire burns you, but if you contain it, it can do amazing things. It isn't the fire being untrustworthy for failing to contain itself and start burning your cloth when you expose your arm to it. You're expecting a dumb tool to be smart and know better. I suspect that is because of the "AI" marketing term and the whole supposition that it is some sort of pseudo-intelligence. it's just auto-complete. When you have it run code in an environment, it could auto-complete 'rm -rf /'.

AnimalMuppet · 2026-02-25T16:39:40 1772037580

> Fire burns you, but if you contain it, it can do amazing things. It isn't the fire being untrustworthy for failing to contain itself and start burning your cloth when you expose your arm to it.

True. But I expect my furnace to be trustworthy to not burn my house down. I expect my circular saw to come with a blade guard. I expect my chainsaw to come with an auto-stop.

But you are correct that in the AI area, that's not the kind of tool we have today. We have dangerous tools, non-OSHA-approved tools, tools that will hurt you if you aren't very careful with them. There's been all this development in making AI more powerful, and not nearly enough in ergonomics (for want of a better word).

We need tools that actually work the way the users expect. We don't have that. (And, as you say, marketing is a big part of the problem. People might expect closer to what the tool actually does, if marketing didn't try so hard to present it as something it is not.)

AlexCoventry · 2026-02-25T04:11:06 1771992666

Not sure whether you're being sarcastic, either.

https://en.wikipedia.org/wiki/Business_Plot

AlexCoventry · 2026-02-24T08:24:47 1771921487

OpenAI is implying that code may no longer be human readable in some circumstances.

> The resulting code does not always match human stylistic preferences, and that’s okay. As long as the output is correct, maintainable, and legible *to future agent runs*, it meets the bar.

https://openai.com/index/harness-engineering/

AlexCoventry · 2026-02-24T06:39:02 1771915142

> If you look at the code, you’ll notice it has a strong “translated from C++” vibe. That’s because it is translated from C++. The top priority for this first pass is compatibility with our C++ pipeline. The Rust code intentionally mimics things like the C++ register allocation patterns so that the two compilers produce identical bytecode. Correctness is a close second. We know the result isn’t idiomatic Rust, and there’s a lot that can be simplified once we’re comfortable retiring the C++ pipeline.

Does this still get you most of the memory-safety benefits of using Rust vs C++?

joelthelion · 2026-02-24T07:37:30 1771918650

I think this largely depends on how much unsafe Rust they produced.

AlexCoventry · 2026-02-16T07:59:16 1771228756

He didn't even have to be the one buying them. Lots of people benefit from a tool like OpenClaw getting popular.

AlexCoventry · 2026-02-16T01:36:05 1771205765

Are there any with a credible approach to security, privacy and prompt injections?

rlt · 2026-02-16T04:35:33 1771216533

Does any credible approach to prompt injection even exist?

joquarky · 2026-02-16T06:10:02 1771222202

Anyone who figures out a reliable solution would probably never have to work again.

AlexCoventry · 2026-02-16T07:53:07 1771228387

Not that I'm aware of, but I probably won't be interested in these kinds of assistants until there are.

AlexCoventry · 2026-02-16T01:34:32 1771205672

He's also a great booster of Codex. Says he greatly prefers it to Claude. So his role might turn out to be evanglism.

ass22 · 2026-02-16T01:37:50 1771205870

Yup, hes highly delusional if he actually thinks Sam cares about him and the project. Its all about optics.

DANmode · 2026-02-16T01:55:21 1771206921

Who purported that Sam cares about him?

Why would he care if Sam cares about him?

ass22 · 2026-02-16T17:51:19 1771264279

Someone clearly hasnt watched the podcast. Do your research before posting.

DANmode · 2026-02-16T20:37:28 1771274248

These are comments on the posted article.

If you want to bring other sources into the conversation, you could link,

or at least reference them by name upfront, right?

whattheheckheck · 2026-02-16T02:30:22 1771209022

Listen to him on a podcast? He said he liked Zuckerberg being more personal with him and Sam was colder

catoc · 2026-02-17T08:03:08 1771315388

He’s not “highly delusional”

He literally said he doesn’t give a fuck about money and ”I will get the fuck out of there [openai] if I don’t like it“ [source friedman interview]

This guy is very smart, and very persistent. I really don’t get all the negativity about the acquisition and especially not about him.

AlexCoventry · 2026-02-15T00:39:11 1771115951

There are actually books which recommend that organizations track employee tokens burned as a proxy for AI adoption. Surprised me a bit.

reactordev · 2026-02-15T01:30:05 1771119005

it's the only KPI available.

AlexCoventry · 2026-02-14T18:31:09 1771093869

Humans don't have much capacity for systematic tree search. It's sort of amazing that humans can do as well as they can, given that limitation.

AlexCoventry · 2026-02-10T20:15:51 1770754551

FWIW, you'd probably be able to buy a lot of goods and services for $7/day, if robots were doing literally all the work.

sp527 · 2026-02-10T21:03:58 1770757438

Agreed. The quality of life bar will be higher for sure. But it will still technically be a "subsistence" lifestyle, with no prospect of improvement. Perhaps that will suffice for most people? We're going to find out.

pdonis · 2026-02-11T00:40:47 1770770447

> if robots were doing literally all the work

Let me know when ChatGPT can do your laundry.

AlexCoventry · 2026-02-13T23:40:50 1771026050

Give it five years.