More

sealeck · 2026-06-08T23:00:37 1780959637

Have we reached the limits of scaling? Sadly it appears that larger model still equals better model

mikestorrent · 2026-06-08T23:32:20 1780961540

Well, let's not forget that text models are not the only models! Video models are much slower and need comparatively more resources, and all they can do even at that size is generate videos a few seconds long. Clearly a ton more work is going to go into those, and demand for them will probably increase as more creative tools get authored using them as a central part of the workflow. Low-res local rendering for preview might be a thing, but the lion's share of the work for high-res, near-realtime rendering is going to be done on huge clusters for a long time yet.

niek_pas · 2026-06-09T10:18:31 1781000311

This is definitely a good point. I imagine the max capacity for video models is significantly lower than for text models (there just aren't as many professionals in video as there are people who write text or code) but I could be wrong.

pixelready · 2026-06-08T23:12:14 1780960334

I think there’s still an open question around are the ultra-large next-gen models worth it? For those of us without early access to Mythos, it’s hard to verify whether it’s been held back from the public due to actually being “too dangerously powerful to release yet” as implied or because the gains aren’t outpacing the costs.

mindwok · 2026-06-08T23:36:58 1780961818

I think GPT 4.5 showed that there is indeed a practical limit we're close too. That was supposedly a high-trillions of parameter model that was deprecated almost immediately because it was slow, insanely expensive, and had questionable benefits over the smaller models. Though apparently the new Mythos and whatever GPT Spud is (if it wasn't 5.5) are back up in the high trillions.

XenophileJKO · 2026-06-08T23:49:59 1780962599

Actually having used it a bit, I'm quite excited to see a modern model of similar size.

I think what people didn't realize was, just because the GPT-4.5 model didn't get better on the benchmarks, didn't mean the model wasn't different than the earlier models. It was being compared to thinking models that were being developed at the same time.

The GPT 4.5 model still has some of the most "human" like abilities in communication even though it isn't particularly good a problem solving. It hadn't under gone the same type of reinforcement training.

I still use GPT 4.5 sometimes, in creative exercises it can be surprisingly effective. The model is still available.

adgjlsfhk1 · 2026-06-09T02:17:50 1780971470

yes and no. We've reached the point where larger models are higher quality, but they're also too expensive and slow to be used broadly. The giant models, however are still useful for training smaller models that are actually deployable.

stogot · 2026-06-08T23:06:13 1780959973

It’s still diminishing returns yes? It isn’t Moore’s Law

sealeck · 2026-05-30T16:37:23 1780159043

I think the issue with these kind of stances is that they are basically status quo bias; why don't you object to the computer itself, and thus refuse to write programs? After all: they were invented by the UK military in the pursuit of military goals (and much of their subsequent development was funded by the US military - see https://types.pl/@graydon/110648447694201698 - and the fact that ARPAnet, GPS, etc were all military creations). Computer systems are mostly used by large corporations and the military to achieve their goals more effectively.

Usually the objection is that "oh well, the computer can be used for many great things", which isn't particularly satisfying because, um, we can use AI for "good" (better?) things as well (e.g. trying to find novel cures, unlocking the mysteries of protein folding, etc etc).

Then the objection becomes something like "well the computer is here and we have to live with it", which is also now true of AI. Do I like the "it's inevitable" argument; no, but it's clearly very true that we do have the transformer, that won't go away - where we DO have control (or should seek to change) is the organisational structures that we as a society decide to create, and how we safeguard the dignity of the individual in changing times.

FloorEgg · 2026-05-30T17:01:18 1780160478

Being able to discern what is and isn't in our control helps tremendously in doing what is right and constructive.

The fact that some people opt out of engaging with AI, I think is healthy for society as a whole. If that's within their control and they exercise their control to do what they think is right, then I commend them.

That said, I do think there is a greater natural force at play, something involving entropy and increasing complexity and energy profit maximization. It seems to cut through all levels of abstraction from organic chemistry to civilizations and probably beyond. I assume this is outside of humanity's control, and therefore outside of any individuals control.

So what is inside our control? Our own perceptions and actions.

My perception is that the advance of computation and by extension proliferation of probabilistic programs (AI) is inevitable. It's on a continuum that is a force of nature.

What I might have some control over is choosing to harness that potential to increase future prosperity for more people and the greater environment, and to avoid contributing to outcomes that harm people and the environment.

Lots of bad things are happening and will happen that are outside my control.

I do genuinely believe that the capabilities are inherently neutral. Civilization can choose to harness them in a variety of ways, for a variety of purposes.

If the majority of people choose options that are game theory win-win, then the future will be better... If the majority of people choose win-lose, then the future will probably be worse.

The risk isn't AI, it's how we choose to use it.

gitaarik · 2026-05-31T07:54:54 1780214094

Yeah so therefore I think a positive attitude is all the more needed, where you see the potentials, see solutions instead of problems. But I feel most anti-AI people are just negative people seeing only problems and don't have any solutions to offer.

sealeck · 2026-05-30T16:17:07 1780157827

Allegedly OpenAI's contracting model is much more vicious than Anthropic's; at work (admittedly a little IP-protective) we have unlimited Claude, but no Codex subscription because OpenAI won't give us sufficient guarantees around data retention.

We are also concerned that it may not be possible to bind OpenAI using contract terms and/or the US legal system.

sealeck · 2026-05-30T13:02:51 1780146171

> The technologists who create it believe they should control it

I think there's an interesting phenomenon where it is _not_ the people who control it, but instead a kind of international finance man cum-captain of industry (perhaps best embodied by Sam Altman) who does not create the technology and yet has ended up wielding the levers.

SilverElfin · 2026-05-30T16:22:54 1780158174

> a kind of international finance man cum-captain of industry (perhaps best embodied by Sam Altman)

What the hell is a “cum-captain”? Search isn’t helping.

huhkerrf · 2026-05-30T16:31:39 1780158699

Probably this: https://en.wiktionary.org/wiki/cum

I.e. finance man as well as captain of industry

whimsicalism · 2026-05-31T15:03:06 1780239786

the correct usage is dashes on both sides of the word, the usage above was extra confusing because it was incorrect

sealeck · 2026-05-30T17:37:57 1780162677

As in cum-"captain of industry"

froh · 2026-05-30T21:30:34 1780176634

non native speaker here - I don't get it.

HappMacDonald · 2026-05-30T21:50:42 1780177842

It's an old English word (predating the sexual connotation):

cum - Used in indicating a thing or person which has two or more roles, functions, or natures, or which has changed from one to another.

Basically nobody uses that language construct anymore until you run headlong into it in a Hackernews comment or something

whimsicalism · 2026-05-31T15:02:05 1780239725

I use it, but you're supposed to hyphenate on both sides so this usage was incorrect.

froh · 2026-05-31T04:58:29 1780203509

thank you! this was most helpful to lead me into asking the right question.

https://share.google/aimode/dDekJEZzfKaE6FCvH

ARandomerDude · 2026-05-31T00:46:08 1780188368

“cum” (rhymes with “broom”, rather than “dumb”) is Latin for “with”.

curio_Pol_curio · 2026-05-31T01:23:30 1780190610

that's when it appears cum "laude" (eg)

in commonwealth (seniors in everyday UK, HR and pedants otherwise) usage it rhymes with dumb, like you'd expect

https://youtu.be/RzESsmv5FhM

Radcliff-cum-Chackmore

anigbrowl · 2026-05-31T03:18:03 1780197483

No it isn't. It's Latin.

froh · 2026-05-31T04:59:45 1780203585

only the ethymology is latin. the use combining role names is old English, indeed.

anigbrowl · 2026-05-31T03:17:08 1780197428

'cum' is latin for 'with', and it is commonly hyphenated when inserted in between other words.

It's also a slang word for semen, but that's not relevant here.

LtWorf · 2026-05-30T23:23:05 1780183385

I suspect they mean "with" but in latin. But I'm not entirely sure.

hunterpayne · 2026-05-31T02:09:33 1780193373

That's correct

sealeck · 2026-04-12T00:10:35 1775952635

Why does the false positive rate matter if you have a verifiable oracle? You can just disregard anything that fails the oracle

lordofgibbons · 2026-04-12T00:38:03 1775954283

What's the verifiable oracle in this scenario?

throwa356262 · 2026-04-12T08:16:25 1775981785

Write the exploit then run it?

sealeck · 2026-04-07T20:21:40 1775593300

Dystopian present

sealeck · 2026-03-29T01:21:13 1774747273

Surely the other way around? Phone QA process >>> disposable vape QA process...

joecool1029 · 2026-03-29T02:39:07 1774751947

It's not even just a QA thing, consider the use case: A sub-ohm vape head is basically almost shorting what is often a unprotected lithium ion cell (18650 or whatnot). Phones meanwhile are full of temperature sensors, battery pack in the phone has some kind of firmware/monitoring, board on the phone has a charge controller.

There are plenty of good cell manufacturers that won't have problems in this current dumping situation (and will have certain passive protections like a CID to cut the current if it gets too hot). Problem is people like cheap and there are sketchy knockoff cells without those protections and shoddy manufacturing quality.

If there was anything recently that forced the change it was probably the CT scans of the Haribo battery packs showing the cathode/anode overlap. This sort of thing should spook airlines.

Do we still have UL? Do they test battery packs? Why not make it a requirement to only fly with ones that pass lab testing like UL?

justinclift · 2026-03-29T13:26:29 1774790789

For anyone else wonder about this:

> If there was anything recently that forced the change it was probably the CT scans of the Haribo battery packs showing the cathode/anode overlap.

It seems to be this, and yeah, it seems actually bad:

https://www.theverge.com/news/818906/haribo-gummy-bear-power...

idiotsecant · 2026-03-29T15:10:30 1774797030

Monkey paw curls. Good news! All cheap Chinese vapes now put a UL logo on them, making it impossible to find an actually certified vape.

whynotmaybe · 2026-03-29T15:53:35 1774799615

No need, they already have their own CE https://www.hqts.com/differences-between-ce-conformite-europ...

ComputerGuru · 2026-03-29T01:36:30 1774748190

You’re both saying the same thing.

sealeck · 2026-03-29T01:19:27 1774747167

Yes and making a horse drawn cart drive itself was thought to be impossible so why don't we have faster than light travel yet...

Finbel · 2026-03-29T06:43:48 1774766628

Yes but "the search space is too large" is something that has been said about innumerable AI-problems that were then solved. So it's not unreasonable that one doubts the merit of the statement when it's said for the umpteenth time.

hodgehog11 · 2026-03-29T07:27:19 1774769239

I should have been more specific then. The problem isn't that the search space is too large to explore. The problem is that the search space is so large that the training procedure actively prefers to restrict the search space to maximise short term rewards, regardless of hyperparameter selection. There is a tradeoff here that could be ignored in the case of chess, but not for general math problems.

This is far from unsolvable. It just means that the "apply RL like AlphaGo" attitude is laughably naive. We need at least one more trick.

vatsachak · 2026-03-30T00:12:54 1774829574

The other trick could be bootstrapping through mathlib.

As you said brute forcing the search space as the starting procedure would take way too long for the AI to build intuition.

But if we could give it a million or so lemmas of human math, that would be a great starting point.

sealeck · 2026-03-23T18:57:24 1774292244

> I found an (as of yet not-root-caused) error in sqlite (no crash or coredump, just returns the wrong data, and only when using sqlite in ram-only-mode).

You should report this to the SQLite developers - they are very smart and very interested in fixing SQLite correctness bugs!

sealeck · 2026-02-28T01:02:36 1772240556

> Stallman has always been right. It's mind boggling just how right he was about everything.

Mind boggling right about not allowing GCC to be used as a library, his comments on Jeffrey Esptein, a refusal to in any way compromise (e.g. the GNU/Linux meme), etc...

Oh and a recognition that free software, while nice, does not in any way solve the underlying issues he claims it does. Similarly to how letting everyone walk around their local water treatment facility and perform chemical tests doesn't really work and instead the state regulates and hires experts to monitor the water supply...

basilikum · 2026-02-28T02:47:45 1772246865

> his comments on Jeffrey Esptein

Do you disagree with his description of Epstein as a serial rapist? Do you disagree with Stallman's position that Epstein should be described according to the specific crimes he committed: rape instead of using much more vague terms that also encompass much less severe crimes which Epstein himself used to downplay and obscure the actual crimes he committed?

If so, why?

matheusmoreira · 2026-02-28T01:45:00 1772243100

> not allowing GCC to be used as a library

Nothing wrong with that move from a strategic point of view. The objective was to leverage GCC and make others play ball. People who wanted GCC should have been forced to do things the free software way.

Only problem with this is it turned out GCC didn't provide enough leverage. Replacing GCC wasn't difficult enough. People implemented LLVM instead and the rest is history.

Compare that to Linux which literally leaves companies behind in the dust when they refuse to merge. No kernel ABI stability: if out-of-tree stuff gets broken it's not their problem. Companies have a choice: play ball or pay the maintenance costs required to keep up with the biggest free software project ever. That's how it should be.

> his comments on Jeffrey Esptein

By "everything" I of course meant his ideas on computer freedom which is the context of this thread. I don't know or care about his opinions on Epstein.

> a refusal to in any way compromise

As he should. If anything he's not extreme enough. Compromise is the root of many evils.

> a recognition that free software, while nice, does not in any way solve the underlying issues he claims it does

Elaborate.