[New Model]: NV-Embed-v2 #12137

Hypothesis-Z · 2025-01-17T02:54:17Z

It is the SOTA free model of text-embedding.

Not sure... maybe Mistral

The model requires prompt as input in model.encode and latent attantion mask in model.forward but the vLLM API does not support.

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

lycfight · 2025-01-17T09:10:09Z

Does vllm support the nvidia/NV-Embed-v2 model?

luisegarduno · 2025-01-19T17:08:57Z

Duplicate Issue:

Hypothesis-Z added the new model Requests to new models label Jan 17, 2025

Provide feedback