-
Notifications
You must be signed in to change notification settings - Fork 144
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Backport 2.x] Lucene Based k-NN search support - merge feature branch #486
[Backport 2.x] Lucene Based k-NN search support - merge feature branch #486
Conversation
Adds lucene as another k-NN engine that can be used. Adds one supported method to the engine, "hnsw", with parameters "ef_construction" and "m". In addition, adds some layers of abstraction to reduce code depuplication. Signed-off-by: John Mazanec <jmazane@amazon.com> (cherry picked from commit 7a76d11)
…project#454) Adds support for Lucene's KnnVectorQuery type in KNNQueryBuilder. KNNQueryBuilder logic was enhanced to detect when an engine does not need to use KNNQuery type and build the Lucene KnnVectorQuery instead. Signed-off-by: John Mazanec <jmazane@amazon.com> (cherry picked from commit 67a1bce)
…h-project#456) * Add field mapper and per field format Signed-off-by: Martin Gaievski <gaievski@amazon.com> (cherry picked from commit 1fb8047)
…-project#458) * Refactor per field format to a class, add tests Signed-off-by: Martin Gaievski <gaievski@amazon.com> (cherry picked from commit 77d3b2e)
Signed-off-by: Martin Gaievski <gaievski@amazon.com> (cherry picked from commit 39a07dd)
Adds query integration tests for the lucene engine. Moves spacetype to lucene vectorsimilarityfunction translation to SpaceType enum for testability. Makes fields in KNNMethodContext non-null. Signed-off-by: John Mazanec <jmazane@amazon.com> (cherry picked from commit 37bdc74)
…rch-project#483) Signed-off-by: Martin Gaievski <gaievski@amazon.com> (cherry picked from commit 8624725)
Signed-off-by: Martin Gaievski <gaievski@amazon.com>
Codecov Report
@@ Coverage Diff @@
## 2.x #486 +/- ##
============================================
- Coverage 84.09% 84.06% -0.04%
- Complexity 979 1019 +40
============================================
Files 140 146 +6
Lines 4031 4186 +155
Branches 362 373 +11
============================================
+ Hits 3390 3519 +129
- Misses 475 491 +16
- Partials 166 176 +10
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. |
for (int j = 0; j < k; j++) { | ||
float[] primitiveArray = Floats.toArray(Arrays.stream(knnResults.get(j).getVector()).collect(Collectors.toList())); | ||
float distance = TestUtils.computeDistFromSpaceType(spaceType, primitiveArray, queryVector); | ||
float rawScore = VECTOR_SIMILARITY_TO_SCORE.get(spaceType.getVectorSimilarityFunction()).apply(distance); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this change is required to address breaking change in Lucene 9.3. Method convertToScore
existis in Lucene 9.2 but has been removed in Lucene 9.3
Description
Backport of #485 to 2.x, need to do it manually as CI bot cannot do multiple individual commits.
Issues Resolved
#380
Check List
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.