Add global Rerank interface with Cohere Rerank model support #276

infinityrobot · 2025-07-07T04:02:42Z

Important

Note! This PR is branched off and is dependent on the Cohere Provider implementation in #275.
Reranking has been submitted as a separate contribution to allow for a meaningful review independent of the core Cohere Provider implementation. The diff here might be a bit complicated until the core Cohere implementation is approved.

What this does

Note

This PR adds support for reranking to RubyLLM, enabling semantic reranking of search results and document collections. The initial implementation supports Cohere's Rerank models.

Why add reranking?

Reranking is a standard step in retrieval-augmented generation (RAG) pipelines that bridges retrieval and generation workflows. It's not application-specific logic but rather a foundational communication pattern with LLMs that most users implementing search, recommendation, or RAG systems will need.

Reranking broadly serves multiple common use cases:

RAG – Improving context relevance before chat-based text generation
Search enhancement – Semantically ranking search results
Recommendation systems – Ordering items by relevance to user queries
Content curation – Ranking documents, articles, or responses by semantic similarity

Implementation overview

The reranking implementation is primarily via a new RubyLLM::Rerank class which operates via a pattern that mirrors that of the current RubyLLM::Embedding implementation.

Note

To facilitate easy reranking, a default_rerank_model attribute has been added to Configuration which is set to Cohere's rerank-v3.5.

The main entry point is RubyLLM.rerank(...) which delegates to RubyLLM::Rerank.rank(...). This method accepts:

query – The search query to evaluate documents against
documents – An array of text documents to rerank
Optional reranking parameters
- top_n – Top number of results to return
- max_tokens_per_doc – Max tokens per doc for chunking
Optional model parameters (per embeddings an other implementations)
- model, provider, , context, assume_model_exists

Using the Provider Reranking interface – e.g., RubyLLM::Providers::Cohere::Reranking – the API payload with the query, documents, and parameters are prepared and submitted.

If successful, the response is parsed and a Rerank object is returned with:

model – The ID of the model used
results – The results of the reranking, returned as an array of RerankResult objects sorted by relevance_score containing:
- index – The original document index in documents
- relevance_score – Score between 0-1 with 1 being perfectly relevant
- document – The document content tested
search_units – The billable units returned by providers like Cohere, TogetherAI, etc.

Usage example

# Configure your API key
RubyLLM.configure do |config|
  config.cohere_api_key = ENV['COHERE_API_KEY']
end

# Your search query
query = "How do I handle exceptions in Ruby?"

# Candidate documents to rerank
documents = [
  "Ruby uses begin/rescue/end blocks to handle exceptions, similar to try/catch in other languages.",
  "JavaScript async/await syntax makes handling asynchronous operations much easier.",
  "The raise keyword in Ruby allows you to throw custom exceptions with specific error messages.",
  "Python dictionaries are similar to Ruby hashes but use different syntax for iteration.",
  "Ruby's ensure block always executes, making it perfect for cleanup operations like closing files."
]

# Rerank the documents
RubyLLM.rerank(query, documents)
=>
#<RubyLLM::Rerank:0x0000000122cb0d68
 @model="rerank-v3.5",
 @results=
  [#<RubyLLM::RerankResult:0x0000000122cb0ed0
    @document="Ruby uses begin/rescue/end blocks to handle exceptions, similar to try/catch in other languages.",
    @index=0,
    @relevance_score=0.8865877>,
   #<RubyLLM::RerankResult:0x0000000122cb0e80
    @document="The raise keyword in Ruby allows you to throw custom exceptions with specific error messages.",
    @index=2,
    @relevance_score=0.63750535>,
   #<RubyLLM::RerankResult:0x0000000122cb0e58
    @document="Ruby's ensure block always executes, making it perfect for cleanup operations like closing files.",
    @index=4,
    @relevance_score=0.08234999>,
   #<RubyLLM::RerankResult:0x0000000122cb0e30
    @document="JavaScript async/await syntax makes handling asynchronous operations much easier.",
    @index=1,
    @relevance_score=0.031288605>,
   #<RubyLLM::RerankResult:0x0000000122cb0e08
    @document="Python dictionaries are similar to Ruby hashes but use different syntax for iteration.",
    @index=3,
    @relevance_score=0.027483363>],
 @search_units=1>

Implementation notes

Follows existing RubyLLM patterns for model communication
Added detailed documentation in docs/guides/rerank.md
Includes comprehensive error handling and response validation
Supports all Cohere Rerank model variants (rerank-v3.5, rerank-english-v3.0, rerank-multilingual-v3.0, etc.)
Ready to support other ranking providers – e.g., existing Ollama models or in case of potential new providers that offer a Rerank API like TogetherAI
Maintains consistency with existing RubyLLM conventions
Supports context-specific implementations (e.g., rerank_context.rerank(...))
Full specs with VCRs

Type of change

Scope check

I read the Contributing Guide
This aligns with RubyLLM's focus on LLM communication
This isn't application-specific logic that belongs in user code
This benefits most users, not just my specific use case

Quality check

I ran overcommit --install and all hooks pass
I tested my changes thoroughly
I updated documentation if needed
I didn't modify auto-generated files manually (models.json, aliases.json)

API changes

Breaking change
New public methods/classes
Changed method signatures
No API changes

Related issues

No related issues.

infinityrobot added 18 commits July 7, 2025 13:07

Add Cohere API key configuration

f52d211

Add base Cohere Provider

9cef065

Add Cohere Capabilities and Models modules

552d894

Add initial Cohere Chat and Embeddings module implementations

42b18c3

Add Cohere model support to model and alias rake tasks

21bfcd4

Add Cohere models support to Chat and Vision model specs

84e17c3

Refactor Embeddings spec to use Constant and custom model dimensions

5aa271f

Add skipped specs for image embeddings to note for future support

eda8151

Skip Cohere vision model when using remote images in specs

b4fa7a8

Add Cohere API key config to documentation

f6b3511

Remove accidental extend of in progress Cohere::Reranking module

ddabbab

Update models.json

4ee3228

Update aliases.json

c8e659c

Update VCRs for Cohere model specs

0ecbec9

Fix Parsera nil model ID handling in models.rb

d41ba45

Update available model docs

8c57040

Remove empty line in Cohere models module

b5a6865

Remove Rerank reference from Cohere module comment

7de8301

infinityrobot mentioned this pull request Jul 7, 2025

Add Cohere Provider support #275

Open

17 tasks

infinityrobot added 11 commits July 8, 2025 19:21

Add Cohere logos to documentation

3aeaa5f

Merge remote-tracking branch 'upstream/main' into add-cohere-provider

fe9856d

Run rake models:update models:docs aliases:generate

209ae93

Update Cohere embeddings implementation for updates made in crmne#267

f0083c3

Rerun Cohere VCRs

0cded6f

Add reranking configuration

f91143a

Add core Rerank and RerankResult classes

2f73ae6

Add initial RubyLLM and Provider Rerank implementation

6160e3f

Add context-specific Rerank implementation

03019fc

Add Cohere reranking support to Cohere Provider

5bef4ba

Add and update reranking-related model helpers

ee30d9d

infinityrobot added 4 commits July 18, 2025 16:13

Add Rerank and RerankResult specs

4db2065

Add Rerank documentation

ebafce7

Add Rerank spec VCRs

8ca58f3

Update Rerank documentation

36669f1

infinityrobot force-pushed the add-cohere-reranking-support branch from 1e2ae2a to 36669f1 Compare July 18, 2025 06:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add global Rerank interface with Cohere Rerank model support #276

Add global Rerank interface with Cohere Rerank model support #276

Uh oh!

infinityrobot commented Jul 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Add global Rerank interface with Cohere Rerank model support #276

Are you sure you want to change the base?

Add global Rerank interface with Cohere Rerank model support #276

Uh oh!

Conversation

infinityrobot commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this does

Why add reranking?

Implementation overview

Usage example

Implementation notes

Type of change

Scope check

Quality check

API changes

Related issues

Uh oh!

Uh oh!

infinityrobot commented Jul 7, 2025 •

edited

Loading