Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introducing Rerank 3.5: Precise AI Search #963

Open
1 task
ShellLM opened this issue Dec 23, 2024 · 1 comment
Open
1 task

Introducing Rerank 3.5: Precise AI Search #963

ShellLM opened this issue Dec 23, 2024 · 1 comment
Labels
ai-platform model hosts and APIs llm Large Language Models RAG Retrieval Augmented Generation for LLMs use-cases user use case descriptions

Comments

@ShellLM
Copy link
Collaborator

ShellLM commented Dec 23, 2024

Introducing Rerank 3.5: Precise AI Search

COHERE TEAM

DEC 02, 2024

Rerank 3.5 delivers improved reasoning and multilingual capabilities to search complex enterprise data with greater accuracy.

Key Contributors

Daniel Simig, Nabila Abraham, Clifton Poth, Martin Hentschel, Violet Dang, Minghan Li, Michael Peng, Nils Reimers, and Elliott Choi

Rerank 3.5 is our latest AI search foundation model. It enables businesses to significantly improve the relevancy of information surfaced within search and retrieval-augmented generation (RAG) systems.

Rerank 3.5 delivers state-of-the-art features, including:

  • Enhanced reasoning skills to enable understanding complex user questions which express various constraints that have traditionally challenged search systems.
  • Broad data compatibility to search long documents with rich and relevant metadata (e.g. emails, reports), semi-structured data (e.g. tables, JSON), and code.
  • Improved multilingual performance across 100+ languages and industry-leading in global business languages such as Arabic, French, Japanese, and Korean.

"Cohere is a key part of what makes Notion AI work. Their reranker gives us both the speed and quality we need, and it's consistently improving. It's been essential for getting our AI Connectors out the door quickly." - Simon Last, Cofounder & CTO, Notion

Improve enterprise AI systems in minutes

Rerank 3.5 efficiently finds the most relevant business data to answer a user question. It accomplishes this through a method called "cross-encoding" where the model computes a relevance score for a business document in relation to a user question. This method enables highly accurate information understanding, exceeding traditional keyword and embedding search, and can be applied after an initial dense retrieval stage to ensure the answers surfaced to users are maximally precise.

Rerank 3.5 is compatible with any existing search system, can be implemented with just a few lines of code, and tends to have negligible impact to overall system latency. It is most frequently used within retrieval-augmented generation (RAG) applications to provide generative AI models, such as our Command R series, with real-time business context to inform their outputs.

This offers a means of increasing the reliability of AI systems, helping sophisticated businesses move past experimentation and into production deployments to accelerate data-driven decision making.

Understand complex data across 100+ languages

Search systems often fail to retrieve relevant information when users implicitly or explicitly express constraints on what they would like returned. We identified that this was partially due to traditional systems lacking the ability to reason. Rerank 3.5 shows substantial improvements in this area, understanding complex multifaceted questions that other search systems fail to answer.

Reasoning Datasets are adversarial datasets where the user bounds a semantic search with implicit and explicit criteria. Reasoning dataset is measured as P@1 out of 2.

This capability is particularly helpful for businesses operating within specialized industries such as finance, government, energy, manufacturing, and healthcare. For example, on a financial services dataset we curated to be generally representative for common use cases, Rerank 3.5 performance was +23.4% better than Hybrid Search and +30.8% better than BM25. We expect organizations in these industries to observe similar improvements when evaluating performance on their data.

Rerank 3.5 also offers industry-leading multilingual capabilities. It can search across data in 100+ languages, with state-of-the-art accuracy on the following 10 global business languages: Arabic, Chinese, French, German, Hindi, Japanese, Korean, Portuguese, Russian, and Spanish.

When compared to our previous Rerank 3 model, Rerank 3.5 delivers a +26.4% improvement on cross-lingual search where the user query is in a different language than the documents being searched. This helps large organizations eliminate barriers to accessing information across geographies and teams.

Get started with Rerank 3.5

Rerank 3.5 is available today on Cohere's platform, Amazon Bedrock, and Amazon SageMaker. Our latest reranker is available for the first time on Amazon Bedrock through the new Rerank API (learn more). It will soon be available across additional cloud platforms.

"We're excited to work with Cohere to bring their latest reranker search model to Amazon Bedrock. Cohere Rerank 3.5 delivers state-of-the-art search and retrieval capabilities and enables powerful multilingual RAG applications. Rerank 3.5 enhances search accuracy by reranking keyword and vector results, ensuring only the most relevant content reaches the model—improving both search precision and enterprise efficiency. Whether in finance, hospitality, retail, e-commerce or beyond, Cohere's Rerank 3.5 model in Amazon Bedrock enables organizations to improve how they surface and utilize critical information, delivering better responses while reducing both latency and costs."

  • Rahul Pathak, VP Data & AI GTM, AWS

Rerank 3.5 can also be deployed into any Virtual Private Cloud (VPC) or on-premise environment. To get started with Rerank 3.5 for your business, and learn more about options for large enterprise deployments, please contact our sales team.

Developers can find additional technical details in our documentation.

A notice to existing users

Users of older Rerank models (i.e. rerank-english-v2.0 and rerank-multilingual-v2.0) will need to migrate to a newer version of the model. We recommend that your usage is moved to rerank-v3.5. Fine-tuned models created from these base models are not affected by this deprecation. When accessing these models via Cohere's APIs you will receive a warning - migration must happen by 2025-03-31. We will be in touch with impacted users over the coming days. If you would like to read more about Cohere's deprecation policies, you can find more details in our documentation.

Shutdown Date Older Model New Recommended Model
2025-03-31 rerank-english-v2.0 rerank-v3.5
2025-03-31 rerank-multilingual-v2.0 rerank-v3.5

Suggested labels

None

@ShellLM ShellLM added ai-platform model hosts and APIs llm Large Language Models RAG Retrieval Augmented Generation for LLMs use-cases user use case descriptions labels Dec 23, 2024
@ShellLM
Copy link
Collaborator Author

ShellLM commented Dec 23, 2024

Related content

#759 similarity score: 0.9
#838 similarity score: 0.89
#934 similarity score: 0.89
#948 similarity score: 0.88
#729 similarity score: 0.87
#386 similarity score: 0.87

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ai-platform model hosts and APIs llm Large Language Models RAG Retrieval Augmented Generation for LLMs use-cases user use case descriptions
Projects
None yet
Development

No branches or pull requests

1 participant