Ramalama does not work with granite models #338

vpavlin · 2024-10-21T10:17:29Z

Ollama announced support for IBM Granite https://x.com/ollama/status/1848223852465213703

I tried to run granite3-moe with ramalama

[vpavlin@vpavlin-tuxedo ~/devel/github.com/vpavlin/ramalama(main) ]
$ ramalama run granite3-moe
Pulling dfc8e4074962e215: 100% ▕####################▏ 1.92G/1.92G 3.23MB/s 00:00
[vpavlin@vpavlin-tuxedo ~/devel/github.com/vpavlin/ramalama(main) ]
$ ramalama run granite3-moe

But it fails after download without printing any error log. Latest Ollama works fine with this model

OS: Ubuntu 23.10
Python: 3.11.6
Ramalama:


$ ramalama info 
{
    "Engine": "podman",
    "Image": "quay.io/ramalama/ramalama:latest",
    "Runtime": "llama.cpp",
    "Store": "/home/vpavlin/.local/share/ramalama",
    "Version": 0
}

The text was updated successfully, but these errors were encountered:

ericcurtin · 2024-10-21T10:43:56Z

Fix:

#340

This is specifically a problem with granitemoe models

vpavlin · 2024-10-21T10:51:34Z

FYI not just moe models

$ ramalama run granite3-dense
Pulling 629c1de9fdd794ce: 100% ▕####################▏ 1.49G/1.49G 5.51MB/s 00:00
[vpavlin@vpavlin-tuxedo ~/devel/github.com/vpavlin/ramalama(main) ]
$ ramalama run granite3-dense

ericcurtin · 2024-10-21T11:00:28Z

It's likely the same fix, update llama.cpp , it's amazing how quickly upstream llama.cpp project moves, if you add this patch at least for the granite-moe case you get this:

llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'granitemoe'

because this model architecture was added recently

diff --git a/ramalama/model.py b/ramalama/model.py
index a12cc1e..f9b3fd3 100644
--- a/ramalama/model.py
+++ b/ramalama/model.py
@@ -120,7 +120,7 @@ class Model:
             exec_args.append("-cnv")

         try:
-            exec_cmd(exec_args, False)
+            exec_cmd(exec_args, True)
         except FileNotFoundError as e:
             if in_container():
                 raise NotImplementedError(file_not_found_in_container % (exec_args[0], str(e).strip("'")))

rhatdan · 2024-10-21T20:08:58Z

This should be fixed with release v0.0.20

rhatdan closed this as completed Oct 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ramalama does not work with granite models #338

Ramalama does not work with granite models #338

vpavlin commented Oct 21, 2024

ericcurtin commented Oct 21, 2024

vpavlin commented Oct 21, 2024

ericcurtin commented Oct 21, 2024

rhatdan commented Oct 21, 2024

Ramalama does not work with granite models #338

Ramalama does not work with granite models #338

Comments

vpavlin commented Oct 21, 2024

ericcurtin commented Oct 21, 2024

vpavlin commented Oct 21, 2024

ericcurtin commented Oct 21, 2024

rhatdan commented Oct 21, 2024