[Example] ggml: add Qwen2-VL example #166

dm4 · 2024-12-24T17:16:45Z

Add Qwen2-VL example after [WASI-NN] ggml: support Qwen2VL and bump llama.cpp b4381 WasmEdge/WasmEdge#3930

juntao · 2024-12-24T17:16:48Z

Hello, I am a code review agent on flows.network. Here are my reviews of changed source code files in this PR.

.github/workflows/llama.yml

Potential issues

Repeated Model Downloads: Multiple jobs download the same model files (e.g., llama-2-7b-chat.Q5_K_M.gguf) from Hugging Face, leading to redundant network requests and potential storage bloat. Consider caching these models or using a shared cache step.
Hardcoded Environment Variables: The NGL environment variable is set in the Set environment variable step but also directly used in some job steps without ensuring it's always defined (e.g., --env n_gpu_layers="$NGL"). This can lead to undefined behavior if the variable isn't properly set for all jobs.
Inconsistent Input Formats: Different jobs use varying input formats and prompts, which may confuse or cause errors in models expecting a specific format (e.g., '<start_of_turn>user Where is the capital of Japan? <end_of_turn><start_of_turn>model' vs. $'[INST] <<SYS>>\nYou are a helpful...). Standardizing input formats would improve reliability and maintainability.

Summary of changes

Key Changes:
Updated wasmedge version from ["0.13.5", "0.14.0"] to ["0.14.1"].
Updated plugin to [wasi_nn-ggml-b4419].
Added a new job for Qwen2-VL, including downloading models, building the project, and running inference on an image.

wasmedge-ggml/qwen2vl/Cargo.toml

Potential issues

The edition field is set to "2021", which is not a valid Rust edition; it should be "2018" or "2021" (if using the correct year) but typically specified as "2021" without quotes.
The dependency wasmedge-wasi-nn = "0.7.1" might be outdated; consider checking for a newer version to benefit from bug fixes and improvements.
The name field uses hyphens, which is unconventional for Rust package names; use underscores instead, e.g., wasmedge_ggml_qwen2vl.

Summary of changes

• Added a new Cargo.toml file for package configuration.
• Defined the package name as "wasmedge-ggml-qwen2vl" and set its version to "0.1.0".
• Specified dependencies including serde_json version "1.0" and wasmedge-wasi-nn version "0.7.1".

wasmedge-ggml/qwen2vl/src/main.rs

Potential issues

Error Handling in get_options_from_env: The function uses unwrap() on the result of serde_json::from_str, which can panic if the JSON is malformed. Replace unwrap() with proper error handling to prevent panics.
Tensor Data Size in set_data_to_context: The function does not check if the input data size matches the tensor shape, which could lead to unexpected behavior or errors. Consider adding a size validation check.
Metadata Parsing in get_metadata_from_context: The function assumes that specific keys exist in the metadata JSON without checking, which can cause runtime errors if the keys are missing. Add key existence checks before accessing them.

Summary of changes

Key Changes:
Added Qwen2-VL Example Code: Introduced a new Rust program that integrates with the wasmedge_wasi_nn library to handle multimedia projection and image processing, specifically for the Qwen2-VL model.
Environment Variable Handling: Implemented functions to read required (mmproj, image) and optional environment variables (enable_log, ctx_size, n_gpu_layers) to configure the model's behavior and execution context dynamically.
Interactive Prompt Processing: Added interactive prompt handling where users can input text, and the system processes it through a Qwen2-VL model in an ongoing dialogue format, resetting the context when necessary if the prompt becomes too long or the context is full.

wasmedge-ggml/qwen2vl/wasmedge-ggml-qwen2vl.wasm

Potential issues

Memory Access Issue: The code uses hard-coded memory addresses and indices (e.g., A�j"�, A�lj"�) which can lead to undefined behavior if the data structure layout changes, making the code fragile and difficult to maintain.
Magic Numbers: The presence of numerous magic numbers (e.g., �, �, �) without clear definitions or comments makes it hard to understand their significance, increasing the risk of errors during maintenance.
Lack of Error Checking: The code does not check for errors after calling critical functions like A��j$�� or B�7�, which can lead to silent failures and make debugging more challenging.

Summary of changes

• Added a new example configuration for Qwen2-VL.
• Integrated necessary model-specific parameters and settings for Qwen2-VL.
• Included sample code or instructions to run the Qwen2-VL example.

Signed-off-by: dm4 <[email protected]>

[Example] ggml: add Qwen2-VL example

dd080a5

Signed-off-by: dm4 <[email protected]>

dm4 force-pushed the dm4/qwen2vl branch from 1a77a67 to fc753ac Compare December 25, 2024 03:13

[CI] llama: bump WasmEdge to 0.14.1

e859c65

dm4 force-pushed the dm4/qwen2vl branch from fc753ac to 92ea4d4 Compare December 25, 2024 07:30

dm4 marked this pull request as draft December 25, 2024 08:03

dm4 marked this pull request as ready for review January 2, 2025 08:36

dm4 force-pushed the dm4/qwen2vl branch from 7fc30fb to 8f014ea Compare January 7, 2025 12:07

[CI] llama: test b4419

7e97b6d

Signed-off-by: dm4 <[email protected]>

dm4 force-pushed the dm4/qwen2vl branch from 8f014ea to 7e97b6d Compare January 7, 2025 12:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Example] ggml: add Qwen2-VL example #166

[Example] ggml: add Qwen2-VL example #166

dm4 commented Dec 24, 2024

juntao commented Dec 24, 2024 •

edited

Loading

[Example] ggml: add Qwen2-VL example #166

Are you sure you want to change the base?

[Example] ggml: add Qwen2-VL example #166

Conversation

dm4 commented Dec 24, 2024

juntao commented Dec 24, 2024 • edited Loading

.github/workflows/llama.yml

Potential issues

Summary of changes

wasmedge-ggml/qwen2vl/Cargo.toml

Potential issues

Summary of changes

wasmedge-ggml/qwen2vl/src/main.rs

Potential issues

Summary of changes

wasmedge-ggml/qwen2vl/wasmedge-ggml-qwen2vl.wasm

Potential issues

Summary of changes

juntao commented Dec 24, 2024 •

edited

Loading