UX implications of PVF executor environment versioning

dmitry.sinyavin · April 11, 2023, 12:50pm

Okay, that was expected anyway. Not an urgent concern because, well, how many validators are running Kagome implementation right now? But may become a concern in the future for the aforementioned reasons. I’d have an eye on it.

tomaka · April 11, 2023, 1:05pm

Validators are never going to run Kagome if they can possibly be slashed for doing it, and they can possibly be slashed for it because of the lack of determinism, and the lack of determinism isn’t fixed because validators aren’t running Kagome anyway.
It’s a snake biting its tail.

bkchr · April 11, 2023, 5:09pm

All the things you have written about wasmtime versioning or implementing a custom wasm vm do not really indicate that it is expected. All of this is indicating that there should be one implementation to prevent indetermenism between different implementations. Which would make everything we are worked on useless as @tomaka already said.

We need to start defining an environment to run validation in that isn’t bound to special implementations, at least for the general vm. For stuff like stack depth metering we need to implement a way that is implementation agnostic and can be implemented by everybody. The same applies to all the other problems. For sure we will hit hidden bugs that will lead to slashes, but we can revert these slashes, learn and ensure that it never happens again.

dmitry.sinyavin · April 11, 2023, 7:57pm

That was expected because we don’t enforce a concrete VM implementation in our spec right now, so different implementations would have emerged sooner or later. That happened sooner. Okay, now we should deal with it somehow.

I still insist it’s not technically possible. If you see how to implement it, please tell me. I mean, implementation-agnostic native stack metering is not rocket science, but implementation-agnostic stack usage by different implementations is science fiction, as far as I can tell. And without deterministic usage, deterministic metering makes no sense.

eskimor · April 17, 2023, 8:31am

I just had another idea. First we should get a complete list of indeterminisms, then we might be able to extend the time dispute mechanism for other cases as well:

bkchr · April 17, 2023, 9:20am

Ty! In generally something that could be worked/tried out as part of: https://polkadot.polkassembly.io/post/1683

burdges · April 17, 2023, 9:51am

Around this, we’ve discussed rerunning PVF builds whenever wasmtime changes:

github.com/paritytech/polkadot

Run preparation for all existing PVFs on the network before a release updating wasmtime

opened 09:48AM - 11 Apr 23 UTC

eskimor

We assume that once a PVF passed pre-checking that it will compile just fine als…o in the future. We should therefore make sure that whenever we are upgrading wasmtime or doing any changes to the preparation process that all existing/already registered PVFs on the network would still pass pre-checking with those changes. - [ ] Provide tool for automatically scraping all PVFs from chain and compiling them. - [ ] Include that tool in the release process # Why is this important? If previously working PVFs suddenly stop working, even for a single parachain/parathread, relay chain finality would stall until the issue is resolved: Better to discover issues before we hit production. @ordian I believe you already have some tooling in place which could help with this? @Sophia-Gold Once we have the tooling, we would need to coordinate with the release team and CI/CD to automate those checks. Safe path would be to run those compilations as part of each release.

It bring up one question:

Anytime a parachain updates its PVF then all validators build he PVF and vote upon the result. We do not attempt further optimization here because they’ll all build the PVF eventually anyways.

Can we similarly assume that “most” wasmtime updates require every validator to rebuild every PVF?

tomaka · April 17, 2023, 10:19am

Nobody has yet explained what was the problem with the existing stack metering, which I’ve linked twice already.

If we track the logical stack usage in a deterministic way, and assume that wasmtime (and other Wasm implementations) are implemented in a way that there exists a k such that native_stack <= k * logical_stack, then the problem is solved.

We would only have to determine this k, and I don’t see why we couldn’t set k to an absurdly high value like for example 1024.

eskimor · April 17, 2023, 10:53am

I had a similar thought today. Is there such a k - I think there should be, if so I agree that the problem seems solved.

eskimor · April 17, 2023, 10:57am

@dmitry.sinyavin can you get in touch with them to build up how that conformance testing should actually look like. I think we need to become more concrete. E.g. what exactly is the spec we are testing conformance to and how?

m-cat · April 17, 2023, 4:19pm

Perhaps related:

github.com/paritytech/substrate

Precise stack depth metering for PVF

opened 10:30AM - 07 Jul 21 UTC

pepyakin

Currently for PVF execution we are using a rather naive algorithm for stack mete…ring. See: - https://github.com/paritytech/wasm-utils/blob/master/src/stack_height/mod.rs - https://github.com/paritytech/substrate/blob/3cd75117765c4a63d40c00aa41e1bf12135c237b/client/executor/wasmtime/src/runtime.rs#L262-L304 - https://github.com/paritytech/substrate/blob/3cd75117765c4a63d40c00aa41e1bf12135c237b/client/executor/wasmtime/src/runtime.rs#L322-L331 However this simple model does not actually represent the underlying behavior of an optimizing compiler. Right now we try to overcome this by generously allocating the stack space for a relative small number of logical items. What I've missed is that the regalloc will generate spill slots based on the number of active live ranges. See this discussion https://bytecodealliance.zulipchat.com/#narrow/stream/217126-wasmtime/topic/deterministic.20stack.20usage Introducing the additional parameters into the mix is rather annoying and provides a very rough upper bound which I am not sure if useful. As an alternative, Chris F. has suggested to look into having a virtually unlimited native stack and throw a stack overflow trap based on logical consumption. It's not trivial to implement though. There are certain considerations, i.e. how exactly provide such a stretchable native stack. If done naively we could introduce performance cliffs (such as what segmented stacks in Go had).

dmitry.sinyavin · April 18, 2023, 7:33am

Well, I kinda tried to explain that… There’s no ~~spoon~~ k. That’s what was assumed from the very start, that there’s some proportion here, and we can rely on it, and I was assigned to check that assumption. Unfortunately, it’s just wrong. You can produce a function that uses close to zero WASM value stack and creates a huge native stack frame, and vice-versa, you can compile a WASM bytecode utilizing a lot of value stack depth into native code that doesn’t create a stack frame at all. Moreover, that behavior (concrete stack sizes) deviates between different environment versions and differs dramatically between different environments. So even if k existed, it would be useless as we assume the possibility of using different environments and each one’s compilation logic is different.

Topic		Replies	Views
Deterministic PVF executor Tech Talk	15	785	October 13, 2023
UX of distributing multiple binaries [take 2] Tech Talk security , validators , ux	46	1567	June 8, 2023
[Guide] How to upgrade your runtime to the latest version of Polkadot SDK and not die trying Tech Talk frame , fellowship	1	184	May 23, 2025
Polkadot Release Analysis v0.9.34 Ecosystem release-analysis	7	1375	January 11, 2023
ParityTech update for August 2024 Ecosystem	0	128	September 20, 2024

UX implications of PVF executor environment versioning

Related topics