More

cyanf · 2025-12-21T16:10:32 1766333432

Why would you want to ssh into a machine that's not yours? That's a violation of the Computer Frauds and Abuse Act, up to 10 years in prison!

montroser · 2025-12-21T17:01:42 1766336502

I think you're joking, but to clarify -- not personally yours. A misbehaving worker box, an app server in the staging environment, etc. A resource owned by the organization for which you work, where it would not be appropriate for you to customize it to your own liking

otterley · 2025-12-21T16:13:50 1766333630

When you have permission to do so, it isn’t.

cyanf · 2025-11-20T21:40:54 1763674854

Tragedy of the aggregate.

cyanf · 2025-10-29T21:57:59 1761775079

There are existing solutions for queues in Postgres, notably pgmq.

cyanf · 2025-10-02T13:50:40 1759413040

Despite sentiments around Mojo being negative on HN due to the stack not being OSS, this is the ultimate goal of Modular.

https://signalsandthreads.com/why-ml-needs-a-new-programming...

pohl · 2025-10-02T14:07:43 1759414063

I listened to that episode, by chance, last week. It was well worth the time to listen.

cyanf · 2025-10-01T08:14:37 1759306477

The blog's title can be misleading here, "we" in this context refers to the Cognition team. I don't work at Cognition, just thought this was interesting.

cyanf · 2025-09-17T22:17:34 1758147454

> On August 29, a routine load balancing change unintentionally increased the number of short-context requests routed to the 1M context servers. At the worst impacted hour on August 31, 16% of Sonnet 4 requests were affected.

Interesting, this implies that the 1M context servers performs worst at low context. Perhaps this is due to some KV cache compression, eviction or sparse attention scheme being applied on these 1M context servers?

kiratp · 2025-09-17T22:45:14 1758149114

This is due to RoPE scaling.

> All the notable open-source frameworks implement static YaRN, which means the scaling factor remains constant regardless of input length, potentially impacting performance on shorter texts. We advise adding the rope_scaling configuration only when processing long contexts is required. It is also recommended to modify the factor as needed. For example, if the typical context length for your application is 524,288 tokens, it would be better to set factor as 2.0.

https://huggingface.co/Qwen/Qwen3-Next-80B-A3B-Thinking

FossQuestion · 2025-09-18T16:14:28 1758212068

The key issue is that their post-mortem never explained what went wrong on two out of three issues.

All I know is that my requests can now travel along three completely different code paths, each on its own stack and tuned differently. Those optimizations can flip overnight, independent of any model-version bump—so whatever worked yesterday may already be broken today.

I really don't get the praise that they are getting for this postmortem, it only made me more annoyed.

cyanf · 2025-09-03T18:26:11 1756923971

Snappiness is the primary reason for using Zed.

cyanf · 2025-09-03T18:24:33 1756923873

The other examples you listed are valid, but A.I tab auto complete is a model & inference issue unrelated to the editor.

rvnx · 2025-09-03T19:07:00 1756926420

It is a feature that they control. Whether it comes from the model, a bad prompt, a bad provider or a bug in their implementation is their responsibility (especially considering you have to pay per-request AI features).

cyanf · 2025-09-03T19:44:28 1756928668

That’s true if we’re evaluating Zed as a product, but the GP is discussing Zed U.I perf specifically.

fkyoureadthedoc · 2025-09-03T19:31:27 1756927887

Idk if 'linux + gpu = problem' is surprising or very relevant either.

cyanf · 2025-08-19T19:40:10 1755632410

I have the same set of requirements you’re describing and Obsidian is perfect.

You can disable the graph feature and never link any notes.

cyanf · 2025-07-11T16:31:58 1752251518

This is both the largest oss model release thus far, and the largest Muon training run.