> rather than a virtual CPU architecture The wasm security model is a lot more c...

hajile · on Nov 3, 2023

You could design a CPU to accept WASM as an input...

https://www.destroyallsoftware.com/talks/the-birth-and-death...

slaymaker1907 · on Nov 3, 2023

I think you could really speed up WASM a lot if CPUs supported the WASM sandbox better. For example, it would be nice to have modern segmented memory support where you could set an offset and max address such that all other pointer operations worked off that mini-TLB, generating an interrupt if the bounds are bypassed.

torginus · on Nov 4, 2023

This kinda exists in hardware in mid-size microcontrollers, and is called the Memory Protection Unity:

Here's an example:

https://developer.arm.com/documentation/ddi0439/b/Memory-Pro...

More complex designs tend to go for a full-blown MMU, and I wonder if the presence of such a feature would be warranted when you could go for either full-blown process isolation (which afaik is not that expensive on modern CPUs), or just go with static and dynamic checks (basically and if statement that checks if the address is valid, which can be optimized away like 90% of the time, when iterating through known arrays etc.)

slaymaker1907 · on Nov 6, 2023

The part that would be nice to bypass is TLB switching and cache invalidation. WASM doesn't need full page translation because it largely assumes the containing process already does that and because it doesn't support allocating non-contiguous memory without using multiple linear memories. Even with multiple linear memories, it still doesn't require (or event want) page translation because these memories each have their own address space.

The issue with if statements are that the stats/bits branch predictors use are a finite resource. You really need these checks to be inlined as well because otherwise you'll thrash the instruction cache. If it just had some special register with an interrupt, the CPU could just always assume the index is valid for the purposes of speculative execution and branch prediction.

kaba0 · on Nov 3, 2023

This is sorta what plenty runtimes do, but on a larger scale with pages with incorrect permissions. I don’t think that interrupts going through the OS would be feasibly fast for scales at typical array sizes, probably a properly branch-predicted conditional will be faster.

saagarjha · on Nov 3, 2023

WASM has a very simple security model compared to almost any modern CPU. No processor has a linear memory map with no segmentation or permissions these days.

marcosdumay · on Nov 3, 2023

Yes, it's simple. And yet it doesn't allow things like high-performance GC.

That's why they are making it more complex.

saagarjha · on Nov 3, 2023

I mean it does, it just means that you need to bundle your own high-performance GC. And people don't really want to do that.