Introducing Rerank 3.5: Precise AI Search #963
Labels
ai-platform
model hosts and APIs
llm
Large Language Models
RAG
Retrieval Augmented Generation for LLMs
use-cases
user use case descriptions
Introducing Rerank 3.5: Precise AI Search
COHERE TEAM
DEC 02, 2024
Rerank 3.5 delivers improved reasoning and multilingual capabilities to search complex enterprise data with greater accuracy.
Key Contributors
Daniel Simig, Nabila Abraham, Clifton Poth, Martin Hentschel, Violet Dang, Minghan Li, Michael Peng, Nils Reimers, and Elliott Choi
Rerank 3.5 is our latest AI search foundation model. It enables businesses to significantly improve the relevancy of information surfaced within search and retrieval-augmented generation (RAG) systems.
Rerank 3.5 delivers state-of-the-art features, including:
Improve enterprise AI systems in minutes
Rerank 3.5 efficiently finds the most relevant business data to answer a user question. It accomplishes this through a method called "cross-encoding" where the model computes a relevance score for a business document in relation to a user question. This method enables highly accurate information understanding, exceeding traditional keyword and embedding search, and can be applied after an initial dense retrieval stage to ensure the answers surfaced to users are maximally precise.
Rerank 3.5 is compatible with any existing search system, can be implemented with just a few lines of code, and tends to have negligible impact to overall system latency. It is most frequently used within retrieval-augmented generation (RAG) applications to provide generative AI models, such as our Command R series, with real-time business context to inform their outputs.
This offers a means of increasing the reliability of AI systems, helping sophisticated businesses move past experimentation and into production deployments to accelerate data-driven decision making.
Understand complex data across 100+ languages
Search systems often fail to retrieve relevant information when users implicitly or explicitly express constraints on what they would like returned. We identified that this was partially due to traditional systems lacking the ability to reason. Rerank 3.5 shows substantial improvements in this area, understanding complex multifaceted questions that other search systems fail to answer.
Reasoning Datasets are adversarial datasets where the user bounds a semantic search with implicit and explicit criteria. Reasoning dataset is measured as P@1 out of 2.
This capability is particularly helpful for businesses operating within specialized industries such as finance, government, energy, manufacturing, and healthcare. For example, on a financial services dataset we curated to be generally representative for common use cases, Rerank 3.5 performance was +23.4% better than Hybrid Search and +30.8% better than BM25. We expect organizations in these industries to observe similar improvements when evaluating performance on their data.
Rerank 3.5 also offers industry-leading multilingual capabilities. It can search across data in 100+ languages, with state-of-the-art accuracy on the following 10 global business languages: Arabic, Chinese, French, German, Hindi, Japanese, Korean, Portuguese, Russian, and Spanish.
When compared to our previous Rerank 3 model, Rerank 3.5 delivers a +26.4% improvement on cross-lingual search where the user query is in a different language than the documents being searched. This helps large organizations eliminate barriers to accessing information across geographies and teams.
Get started with Rerank 3.5
Rerank 3.5 is available today on Cohere's platform, Amazon Bedrock, and Amazon SageMaker. Our latest reranker is available for the first time on Amazon Bedrock through the new Rerank API (learn more). It will soon be available across additional cloud platforms.
Rerank 3.5 can also be deployed into any Virtual Private Cloud (VPC) or on-premise environment. To get started with Rerank 3.5 for your business, and learn more about options for large enterprise deployments, please contact our sales team.
Developers can find additional technical details in our documentation.
A notice to existing users
Users of older Rerank models (i.e.
rerank-english-v2.0
andrerank-multilingual-v2.0
) will need to migrate to a newer version of the model. We recommend that your usage is moved torerank-v3.5
. Fine-tuned models created from these base models are not affected by this deprecation. When accessing these models via Cohere's APIs you will receive a warning - migration must happen by 2025-03-31. We will be in touch with impacted users over the coming days. If you would like to read more about Cohere's deprecation policies, you can find more details in our documentation.rerank-english-v2.0
rerank-v3.5
rerank-multilingual-v2.0
rerank-v3.5
Suggested labels
None
The text was updated successfully, but these errors were encountered: