One of the things I’ve been hoping for every time a new EC2 instance comes out i...

zozbot234 · 2025-12-28T21:27:10 1766957230

The best way of "unpinning" those ratios for many ephemeral workloads is to use Lambda/FaaS, not EC2.

Kodiack · 2025-12-28T23:42:35 1766965355

For many ephemeral workloads, sure, but that comes at the expense of generally worse and less consistent CPU performance.

There are plenty of workloads where I’d love to double the memory and halve the cores compared to what the memory-optimised R instances offer, or where I could further double the cores and halve the RAM from what the compute-optimised C instances can do.

“Serverless” options can provide that to an extent, but it’s no free lunch, especially in situations where performance is a large consideration. I’ve found some use cases where it was better to avoid AWS entirely and opt for dedicated options elsewhere. AWS is remarkably uncompetitive in some use cases.

hinkley · 2025-12-28T21:37:48 1766957868

kv stores also exist because for many generations of tooling it was faster to manage read-mostly data off-heap instead of on, and that becomes more true the more processes you run doing jobs that touch the same data.

electroly · 2025-12-29T15:22:28 1767021748

Note: the scale does go further than "r" on the high-memory end with some specialty "x" families.

    c*: 2GB per vCPU
    m*: 4GB per vCPU
    r*: 8GB per vCPU
    x2idn/x8g: 16GB per vCPU (!)
    x2iedn/x2iezn/x8aedz: 32GB per vCPU (!)

hinkley · 2025-12-29T21:57:08 1767045428

Yeah those are pretty spendy. I know one comes with extra guaranteed bandwidth which is kind of handy if you’re sharing a small number of cache nodes among a lot of servers. But we were doing okay running r6 for cache, though my coworker who knew the ritual for migrating them did eventually get a little boost out of switching us to r7’s. The latency wasn’t great and I don’t think faster network cards would have helped that. There was already plenty of incentive for us to do per-request promise caching to avoid pulling the same keys multiple times in a request but that was necessary because the business model forced the architecture to tolerate nondeterminism. The cost per request was what eventually killed them (the economy dipped and customers ran to cheaper vendors), but I’ve never seen a company survive being stupid for as long as this place did.

Well, except IBM. Maybe Yahoo.

jeffbee · 2025-12-28T22:42:00 1766961720

You should just get used to it because the memory per core is going down inexorably forever until someone makes a physics breakthrough. We know how to print cores and the core count is going to keep going up.

hinkley · 2025-12-29T00:07:59 1766966879

We knew how to print memory long before we knew how to print cores.

jeffbee · 2025-12-29T00:30:30 1766968230

Logic and DRAM are totally different processes.

zozbot234 · 2025-12-29T00:45:01 1766969101

You can build eDRAM using logic processes. It's not usually done since ordinary DRAM ends up being cheaper, but if the usual DRAM processes are bottlenecked (and SRAM cell scaling is also hitting roadblocks of its own) that makes eDRAM a lot more viable, at least for specialty uses.