Skip to content

server : fill usage info in embeddings and rerank responses #17714

server : fill usage info in embeddings and rerank responses

server : fill usage info in embeddings and rerank responses #17714

windows-latest-cmake (openblas-x64, -DGGML_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON -DGGML_RPC=ON -DGGM...

succeeded Dec 16, 2024 in 5m 12s