Speed up (filtered) KNN queries for flat vector fields #130251

jimczi · 2025-06-27T18:49:31Z

For dense vector fields using the flat index, we already know a brute-force search will be used, so there’s no need to go through the codec’s approximate KNN logic. This change skips that step and builds the brute-force query directly, making things faster and simpler.

I tested this on a setup with 10 million random vectors, each with 1596 dimensions and 17,500 partitions, using the random_vector track. The results:

Performance Comparison

Metric	Before	After	Change
Throughput	221 ops/s	2762 ops/s	🟢 +1149%
Latency (p50)	29.2 ms	1.6 ms	🔻 -94.4%
Latency (p99)	81.6 ms	3.5 ms	🔻 -95.7%

Filtered KNN queries on flat vectors are now over 10x faster on my laptop!

For dense vector fields using the `flat` index, we already know a brute-force search will be used—so there’s no need to go through the codec’s approximate KNN logic. This change skips that step and builds the brute-force query directly, making things faster and simpler. I tested this on a setup with **10 million random vectors**, each with **1596 dimensions** and **17,500 partitions**, using the `random_vector` track. The results: ### Performance Comparison | Metric | Before | After | Change | | ----------------- | --------- | ---------- | --------- | | **Throughput** | 221 ops/s | 2762 ops/s | 🟢 +1149% | | **Latency (p50)** | 29.2 ms | 1.6 ms | 🔻 -94.4% | | **Latency (p99)** | 81.6 ms | 3.5 ms | 🔻 -95.7% | Filtered KNN queries on flat vectors are now over 10x faster on my laptop!

elasticsearchmachine · 2025-06-27T18:49:57Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

elasticsearchmachine · 2025-06-27T18:49:58Z

Hi @jimczi, I've created a changelog YAML for you.

…e_force_knn_optim

benwtrent

I am loving these numbers. Thank you for digging into this!

benwtrent · 2025-06-27T19:42:15Z

server/src/main/java/org/elasticsearch/index/mapper/vectors/DenseVectorFieldMapper.java

+            Query knnQuery;
+            if (indexOptions != null && indexOptions.isFlat()) {
+                knnQuery = filter == null
+                    ? createExactKnnBitQuery(queryVector)
+                    : new BooleanQuery.Builder().add(createExactKnnBitQuery(queryVector), BooleanClause.Occur.SHOULD)
+                        .add(filter, BooleanClause.Occur.FILTER)
+                        .build();
+                if (parentFilter != null) {
+                    knnQuery = new ToParentBlockJoinQuery(knnQuery, parentFilter, ScoreMode.Max);
+                }
+            } else {
+                knnQuery = parentFilter != null
+                    ? new ESDiversifyingChildrenByteKnnVectorQuery(name(), queryVector, filter, k, numCands, parentFilter, searchStrategy)
+                    : new ESKnnByteVectorQuery(name(), queryVector, k, numCands, filter, searchStrategy);
+            }


This logic makes me think that indexOptions should satisfy some interfaces for creating queries....but I think that refactor can happen later.

benwtrent · 2025-06-27T20:01:31Z

server/src/main/java/org/elasticsearch/search/vectors/RescoreKnnVectorQuery.java

+            // Retrieve top rescoreK documents from the inner query
+            var topDocs = searcher.search(innerQuery, rescoreK);
+            vectorOperations = topDocs.totalHits.value();
+
+            // Retrieve top k documents from the top rescoreK query
+            var topDocsQuery = new KnnScoreDocQuery(topDocs.scoreDocs, searcher.getIndexReader());
+            var valueSource = new VectorSimilarityFloatValueSource(fieldName, floatTarget, vectorSimilarityFunction);
+            var rescoreQuery = new FunctionScoreQuery(topDocsQuery, valueSource);
+            var rescoreTopDocs = searcher.search(rescoreQuery.rewrite(searcher), k);
+            return new KnnScoreDocQuery(rescoreTopDocs.scoreDocs, searcher.getIndexReader());


We need this because the exact queries don't actually do their own k limitations. Instead we must apply them with the searcher, right?

Just wanting to clarify, this looks very similar to what it was before other than forcing rescoreK in searcher.

Yep, two steps reduction since the first query is not a top k query

benwtrent · 2025-06-27T20:19:31Z

server/src/main/java/org/elasticsearch/index/mapper/vectors/DenseVectorFieldMapper.java

+                if (parentFilter != null) {
+                    knnQuery = new ToParentBlockJoinQuery(knnQuery, parentFilter, ScoreMode.Max);
+                }


🤔 I gotta think about this one...ESDiversifyingChildrenByteKnnVectorQuery actually returns the children docs so that nested queries work in top-level knn and when under a nested vector query.

If no tests fail here, I am thinking we don't actually have coverage for top-level & query level nested knn over flat indices.

I am writing up some yaml tests now to add to main to cover flat & nested

Right good catch. I pushed a fix to remain at the nested level but I realise now that rescoring in this mode is different. We'll rescore non-diversified children so we need to apply collapsing or similar. I'll think more about it while you're adding more tests, thanks!

…e_force_knn_optim

jimczi requested review from benwtrent and carlosdelest June 27, 2025 18:49

jimczi added >enhancement :Search Relevance/Vectors Vector search v9.2.0 labels Jun 27, 2025

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Jun 27, 2025

Update docs/changelog/130251.yaml

99fb2bd

github-actions bot deployed to docs-preview June 27, 2025 18:50 View deployment

jimczi added 2 commits June 27, 2025 20:13

handle null index options

1ae65f9

Merge remote-tracking branch 'origin/brute_force_knn_optim' into brut…

d942f5a

…e_force_knn_optim

github-actions bot deployed to docs-preview June 27, 2025 19:14 View deployment

benwtrent reviewed Jun 27, 2025

View reviewed changes

handle nested fields

55c5cc7

github-actions bot deployed to docs-preview June 27, 2025 21:14 View deployment

nested fields should return docs at the nested level

379f7f9

github-actions bot deployed to docs-preview June 27, 2025 21:20 View deployment

[CI] Auto commit changes from spotless

61df4fa

github-actions bot deployed to docs-preview June 27, 2025 21:27 View deployment

jimczi added 2 commits June 27, 2025 23:43

First pass at diversifying results when the field is nested

3fdc47f

Merge remote-tracking branch 'origin/brute_force_knn_optim' into brut…

b476ad0

…e_force_knn_optim

github-actions bot deployed to docs-preview June 27, 2025 22:44 View deployment

benwtrent mentioned this pull request Jun 27, 2025

Add more test coverage for nested searches over flat vector indices #130263

Open

collapse using a single pass

be25897

github-actions bot had a problem deploying to docs-preview June 27, 2025 23:06 Failure

unused

ed9daad

github-actions bot deployed to docs-preview June 27, 2025 23:07 View deployment

iter

9a1b722

github-actions bot deployed to docs-preview June 28, 2025 00:56 View deployment

jimczi added 2 commits June 28, 2025 11:43

always apply diversification when nested and flat

2371ec3

Merge remote-tracking branch 'upstream/main' into brute_force_knn_optim

e73eea1

github-actions bot deployed to docs-preview June 28, 2025 10:44 View deployment

fix uts

d7dc423

github-actions bot deployed to docs-preview June 28, 2025 11:22 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed up (filtered) KNN queries for flat vector fields #130251

Speed up (filtered) KNN queries for flat vector fields #130251

jimczi commented Jun 27, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Jun 27, 2025

Uh oh!

elasticsearchmachine commented Jun 27, 2025

Uh oh!

benwtrent left a comment

Uh oh!

benwtrent Jun 27, 2025

Uh oh!

benwtrent Jun 27, 2025

Uh oh!

jimczi Jun 27, 2025

Uh oh!

benwtrent Jun 27, 2025

Uh oh!

benwtrent Jun 27, 2025

Uh oh!

jimczi Jun 27, 2025

Uh oh!

benwtrent Jun 27, 2025

Uh oh!

Uh oh!

Speed up (filtered) KNN queries for flat vector fields #130251

Are you sure you want to change the base?

Speed up (filtered) KNN queries for flat vector fields #130251

Conversation

jimczi commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Performance Comparison

Uh oh!

elasticsearchmachine commented Jun 27, 2025

Uh oh!

elasticsearchmachine commented Jun 27, 2025

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

benwtrent Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

benwtrent Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

jimczi Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

benwtrent Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

benwtrent Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

jimczi Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

benwtrent Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jimczi commented Jun 27, 2025 •

edited

Loading