feat: Linker #87

george-cosma · 2024-09-19T11:26:24Z

Pull Request Overview

This pull request adds a "Lookup table" linker. See #83 first.

Basically, after each time a new module is added we attempt to create a "lookup table". This lookup table translate the imported functions from each module to the exported local function in the already-existing modules.

Resumability is kept by passing all module-specific details (like the wasm binary, the readers, function types, store, etc.) to the interpreter_loop::run function. These details have been compiled into a structure called ExecutionInfo.

Also of note, I split the "FunctionInstance" structure into two variants - local and imported, for clarity sake. This comes with the needed modification to the validation process and store-creation process.

Testing Strategy

This pull request was tested by writing a new test, and tested against the existing tests.

Formatting

Github Issue

This pull request closes #83

src/execution/interpreter_loop.rs

src/execution/value_stack.rs

george-cosma · 2024-09-24T10:03:26Z

Blocked on #88

… for linker Signed-off-by: George Cosma <[email protected]>

codecov · 2024-09-26T13:32:51Z

Codecov Report

Attention: Patch coverage is 90.76087% with 34 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/execution/interpreter_loop.rs	91.20%	11 Missing ⚠️
src/execution/mod.rs	90.75%	3 Missing and 8 partials ⚠️
src/validation/mod.rs	81.57%	7 Missing ⚠️
src/execution/lut.rs	94.00%	2 Missing and 1 partial ⚠️
src/core/error.rs	0.00%	2 Missing ⚠️

Files with missing lines	Coverage Δ
src/execution/execution_info.rs	`100.00% <100.00%> (ø)`
src/execution/store.rs	`76.66% <100.00%> (+23.33%)`	⬆️
src/execution/value_stack.rs	`78.78% <100.00%> (+0.66%)`	⬆️
src/validation/code.rs	`71.42% <100.00%> (-1.07%)`	⬇️
src/core/error.rs	`30.88% <0.00%> (-5.49%)`	⬇️
src/execution/lut.rs	`94.00% <94.00%> (ø)`
src/validation/mod.rs	`76.51% <81.57%> (+0.23%)`	⬆️
src/execution/interpreter_loop.rs	`96.58% <91.20%> (-0.47%)`	⬇️
src/execution/mod.rs	`93.60% <90.75%> (+2.37%)`	⬆️

george-cosma · 2024-09-26T13:55:18Z

src/execution/lut.rs

+            // TODO: what do we want to do if there is a missing import/export pair? Currently we fail the entire
+            // operation. Should it be a RuntimeError if said missing pair is called?


As per internal discussion, "better safe than sorry"

src/execution/lut.rs

george-cosma · 2024-09-26T13:58:10Z

src/execution/mod.rs


+        // TODO: how do we handle the start function, if we don't have a LUT yet?


If a module has a start function, this function will error out if we attempt to run "start" if there are any unmet imports, even if they are not used in "start" itself.

Running an example where we have an import and it's not met in wasm-interp leads to the same error: invalid import. I think this is the safe way to go about it.
(wasm-interp actually errors out even if we don't run any of the code:

(module (import "console" "log" (func $log (param i32))) (func $dummy i32.const 0 drop ) ;; (start $dummy) )

)

src/execution/mod.rs

Signed-off-by: George Cosma <[email protected]>

nerodesu017 · 2024-09-30T12:52:10Z

src/execution/mod.rs

@@ -288,20 +357,30 @@ where
            let (module_idx, func_idx) =
                self.get_indicies(&function_ref.module_name, &function_ref.function_name)?;

-            if module_idx != function_ref.module_index || func_idx != function_ref.function_index {
-                // TODO: should we return a different error here?
+            if module_idx != function_ref.module_index {


Maybe just one if statement here with both conditions combined?

The reason I wanted to split them off was to be able to discern which reference is wrong - the module or the function. This was done to conform to the two errors: ModuleNotFound and FunctionNotFound. However, it would also be valid to say there should be a 3rd error type to indicate a stale or incorrect reference

I see, that makes sense. I also agree on the 3rd error type for an incorrect reference.

cemonem · 2024-12-05T14:34:36Z

src/execution/lut.rs

+                }
+            })
+    }
+}


This scheme can't handle cases where imports cascade. Let A, B, C be modules where A imports func B.f and rexports A.g, and C import A.g as C.h. When h is called, we need to recursively follow until a local func is hit (which we might not). we might need to check the func_idx is not imported itself here, and if it is, follow the other module without a recursive func call (stackless interpreter rule). We should also throw error if the (module_idx,func_idx) ends up the same as local_module_idx, local_func_idx, since that indicates a cyclic reference. But we can also make do with this too and leave it as an issue.

cemonem · 2024-12-05T14:47:29Z

tests/imports.rs

+
+    let wasm_bytes = wat::parse_str(SIMPLE_IMPORT_ADDON).unwrap();
+    let validation_info = validate(&wasm_bytes).expect("validation failed");
+    instance.add_module("env", &validation_info);


This a bit nitpicky, but I am more in favor of an interface where if A depends on B's store values for instantiation and B depends on nothing, B is instantiated with a RuntimeInstance::new_named("B") and then we instantiate a with RuntimeInstance::new_named_with("A", instance_of_B/imports??) sort of thing, since instantiation of A might depend on already instantiated const values etc. in B (https://webassembly.github.io/spec/core/exec/modules.html#external-typing) here I think the spec necessitates that the imports are instantiated within their module store. For globals at least this is definitely required, since a global.get of an imported const global can be used in constant exprs during instantiation, and that itself needs to have been instantiated. I think in js module instantiation is also this way, but i am not sure. But this can stay too.

george-cosma self-assigned this Sep 19, 2024

github-actions bot added execution validation tests priority-high priority-medium labels Sep 19, 2024

george-cosma added this to the Stabilize the Architecture milestone Sep 19, 2024

george-cosma marked this pull request as draft September 19, 2024 11:26

george-cosma mentioned this pull request Sep 23, 2024

Linker API Changes #88

Merged

5 tasks

nerodesu017 reviewed Sep 24, 2024

View reviewed changes

src/execution/interpreter_loop.rs Outdated Show resolved Hide resolved

nerodesu017 reviewed Sep 24, 2024

View reviewed changes

src/execution/value_stack.rs Outdated Show resolved Hide resolved

george-cosma mentioned this pull request Sep 24, 2024

Future bug: Analyze function index offset for imported functions #27

Open

chore: split FuncInst into local and imported variant. In preparation…

ee05bf3

… for linker Signed-off-by: George Cosma <[email protected]>

george-cosma force-pushed the dev/george-linker branch 3 times, most recently from 737c1a5 to 9e29849 Compare September 26, 2024 13:24

george-cosma marked this pull request as ready for review September 26, 2024 13:27

george-cosma commented Sep 26, 2024

View reviewed changes

feat: implement simple linker

bf9202b

Signed-off-by: George Cosma <[email protected]>

george-cosma force-pushed the dev/george-linker branch from 9e29849 to bf9202b Compare September 26, 2024 15:02

github-actions bot added the core label Sep 26, 2024

george-cosma requested review from valexandru, wucke13, nerodesu017 and florianhartung September 27, 2024 11:40

nerodesu017 reviewed Sep 30, 2024

View reviewed changes

cemonem requested changes Dec 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Linker #87

feat: Linker #87

george-cosma commented Sep 19, 2024 •

edited

Loading

george-cosma commented Sep 24, 2024 •

edited by valexandru

Loading

codecov bot commented Sep 26, 2024 •

edited

Loading

george-cosma Sep 26, 2024

george-cosma Sep 26, 2024

nerodesu017 Sep 30, 2024

nerodesu017 Sep 30, 2024

george-cosma Oct 1, 2024

nerodesu017 Oct 2, 2024

cemonem Dec 5, 2024

cemonem Dec 5, 2024

		// TODO: what do we want to do if there is a missing import/export pair? Currently we fail the entire
		// operation. Should it be a RuntimeError if said missing pair is called?


		// TODO: how do we handle the start function, if we don't have a LUT yet?

feat: Linker #87

Are you sure you want to change the base?

feat: Linker #87

Conversation

george-cosma commented Sep 19, 2024 • edited Loading

Pull Request Overview

Testing Strategy

Formatting

Github Issue

george-cosma commented Sep 24, 2024 • edited by valexandru Loading

codecov bot commented Sep 26, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

george-cosma commented Sep 19, 2024 •

edited

Loading

george-cosma commented Sep 24, 2024 •

edited by valexandru

Loading

codecov bot commented Sep 26, 2024 •

edited

Loading