Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(stastsd) not wokring idle slot with latest llama.cpp #19

Closed
bodybreaker opened this issue Aug 28, 2024 · 5 comments
Closed

(stastsd) not wokring idle slot with latest llama.cpp #19

bodybreaker opened this issue Aug 28, 2024 · 5 comments
Assignees
Labels
bug Something isn't working

Comments

@bodybreaker
Copy link

paddler_slots_idle not working with lastest llama.cpp

@mcharytoniuk mcharytoniuk added the bug Something isn't working label Aug 28, 2024
@mcharytoniuk mcharytoniuk self-assigned this Aug 28, 2024
@us58
Copy link

us58 commented Sep 1, 2024

The slot states were moved from the /health endpoint to a new /slots endpoint. See ggerganov/llama.cpp#9056.

@mcharytoniuk
Copy link
Member

mcharytoniuk commented Sep 2, 2024

@us58 Thanks for letting me know. I am working on the update now.

I also mentioned the issue in llama.cpp discussions, such breaking change was kind of unexpected: ggerganov/llama.cpp#9276

@mcharytoniuk
Copy link
Member

@bodybreaker @us58 I released a new version that uses the new /slots endpoint from llama.cpp server: https://github.com/distantmagic/paddler/releases/tag/v0.7.0

Please let me know if that solved StatsD issue.

@debrisapron
Copy link

I think I might be having a problem related to this. In my llama.cpp logs I see the monitoring requests coming in from the paddler agent, but they're still coming into the /health endpoint instead of the /slots endpoint as they should be. This is with release 0.8.0 so as I understand it this should be working?

@mcharytoniuk
Copy link
Member

@debrisapron @bodybreaker I have just released a new stable version of Paddler (v1.0.0) and changed the CLI framework, overall it underwent a total rewrite.

I think your issue should be solved now, if it still persists feel free to reopen (please check the README though, some flag names have changed).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants