feat(ai): add how to query rerank models #5873

RoRoJ · 2025-11-26T13:34:42Z

Add doc for querying rerank models

pages/generative-apis/how-to/query-reranking-models.mdx

Co-authored-by: Benedikt Rollik <brollik@scaleway.com>

nerda-codes · 2025-11-27T11:03:35Z

pages/generative-apis/how-to/query-reranking-models.mdx

+
+For example: a query to a fast (but imprecise) model may return a list of 100 documents. A specialized reranking model can then evaluate these documents more deeply, score each on how well it matches the query, and return only the 10 most relevant documents to the first model to be used in answering the query.
+
+This approach takes advantage of the strengths of each model: one that is fast but not specialized, which can generate candidates quickly, and another than is slow but specialized, to refine these candidates. It can result in reduced context windows with therefore improved relevance, and faster overall query processing time.


Suggested change

This approach takes advantage of the strengths of each model: one that is fast but not specialized, which can generate candidates quickly, and another than is slow but specialized, to refine these candidates. It can result in reduced context windows with therefore improved relevance, and faster overall query processing time.

This approach takes advantage of the strengths of each model: one that is fast but not specialized, which can generate candidates quickly, and another that is slow but specialized, to refine these candidates. It can result in reduced context windows with therefore improved relevance, and faster overall query processing time.

nerda-codes · 2025-11-27T11:05:14Z

pages/generative-apis/how-to/query-reranking-models.mdx

+- Query vector: `qv = embedding(query`)
+- Document vector: `dv = embedding(document content)`


Suggested change

- Query vector: `qv = embedding(query`)

- Document vector: `dv = embedding(document content)`

- Query vector: `qv = embedding` (query)

- Document vector: `dv = embedding` (document content)

nerda-codes · 2025-11-27T11:05:28Z

pages/generative-apis/how-to/query-reranking-models.mdx

+- Document vector: `dv = embedding(document content)`
+- Relevance score: `score = (qv, dv)` (dot product)
+
+Therefore, if you're performing repeated relevance scoring, you can streamline your workflow as follows:


Suggested change

Therefore, if you're performing repeated relevance scoring, you can streamline your workflow as follows:

Therefore, if you are performing repeated relevance scoring, you can streamline your workflow as follows:

fpagny · 2025-11-28T16:38:59Z

Ok for me 👍
The API is not ready to be deployed yet in production, we still need a quick technical fix.
I'll update when it's ready to be merged.

RoRoJ added 3 commits November 25, 2025 18:06

feat(genapi): how to query reranking models

775d6eb

fix(genapis): started ammendmnets

b3055a3

feat(genapis): finished rerank stuff

cfe3e2e

RoRoJ added type: new content New pages or categories do not merge PR that shouldn't be merged before a specific date (eg release) priority: low Maintenance PRs that are not critical. labels Nov 26, 2025

bene2k1 reviewed Nov 26, 2025

View reviewed changes

pages/generative-apis/how-to/query-reranking-models.mdx Outdated Show resolved Hide resolved

bene2k1 approved these changes Nov 26, 2025

View reviewed changes

Apply suggestions from code review

68e59e1

Co-authored-by: Benedikt Rollik <brollik@scaleway.com>

nerda-codes reviewed Nov 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(ai): add how to query rerank models #5873

feat(ai): add how to query rerank models #5873

Uh oh!

RoRoJ commented Nov 26, 2025

Uh oh!

Uh oh!

nerda-codes Nov 27, 2025

Uh oh!

nerda-codes Nov 27, 2025

Uh oh!

nerda-codes Nov 27, 2025

Uh oh!

fpagny commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants


		For example: a query to a fast (but imprecise) model may return a list of 100 documents. A specialized reranking model can then evaluate these documents more deeply, score each on how well it matches the query, and return only the 10 most relevant documents to the first model to be used in answering the query.

		This approach takes advantage of the strengths of each model: one that is fast but not specialized, which can generate candidates quickly, and another than is slow but specialized, to refine these candidates. It can result in reduced context windows with therefore improved relevance, and faster overall query processing time.

		- Query vector: `qv = embedding(query`)
		- Document vector: `dv = embedding(document content)`

	Therefore, if you're performing repeated relevance scoring, you can streamline your workflow as follows:
	Therefore, if you are performing repeated relevance scoring, you can streamline your workflow as follows:

feat(ai): add how to query rerank models #5873

Are you sure you want to change the base?

feat(ai): add how to query rerank models #5873

Uh oh!

Conversation

RoRoJ commented Nov 26, 2025

Uh oh!

Uh oh!

nerda-codes Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

nerda-codes Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

nerda-codes Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

fpagny commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants