Healthcheck endpoint? #4746

JohnGalt1717 · 2024-01-03T03:40:37Z

Prerequisites

Please answer the following questions for yourself before submitting an issue.

[ X] I am running the latest code. Development is very rapid so there are no tagged versions as of now.
[ X] I carefully followed the README.md.
[ X] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
[ X] I reviewed the Discussions, and have a new bug or useful enhancement to share.

Feature Description

I'd like the server to have a /health endpoint.

Motivation

Basically this is for docker so that I can restart the server if it fails automatically and recover given that llama.cpp will crash the process in many cases.

Possible Implementation

/health as a get that will return 200 if it's healthy and obviously timeout if the server isn't available. The best I can do right now is props

Also, the final docker container needs to have curl or wget installed into it, and then documentation updated to show how to go and use the docker-compose functionality to do this.

I'd also like to see /completion and the async version return a 429 error instead of 404 when it is busy as 429 is easily retried but 404 is a not found and thus terminal.

Huge · 2024-01-10T14:52:03Z

#4853 seems related, no clue why that PR is "closed".

Celarye · 2024-01-11T22:28:34Z

These have been added through #4881 instead.

bsquizz · 2024-07-25T20:47:16Z

There's an issue with the llama-server.Dockerfile -- curl is not installed in the final runtime layer, so the health check can not be run from inside the image. PR open here to fix it: #8693

JohnGalt1717 added the enhancement New feature or request label Jan 3, 2024

phymbert mentioned this issue Feb 17, 2024

server: enhanced health endpoint #5548

Merged

ggerganov closed this as completed in #5548 Feb 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Healthcheck endpoint? #4746

Healthcheck endpoint? #4746

JohnGalt1717 commented Jan 3, 2024 •

edited

Loading

Huge commented Jan 10, 2024

Celarye commented Jan 11, 2024

bsquizz commented Jul 25, 2024

Healthcheck endpoint? #4746

Healthcheck endpoint? #4746

Comments

JohnGalt1717 commented Jan 3, 2024 • edited Loading

Prerequisites

Feature Description

Motivation

Possible Implementation

Huge commented Jan 10, 2024

Celarye commented Jan 11, 2024

bsquizz commented Jul 25, 2024

JohnGalt1717 commented Jan 3, 2024 •

edited

Loading