Lucene multiple indexes: normalize the number of documents?

Suppose I have several lucene indexes (not replicas) on multiple PCs.

I query each index and then combine the results. Is there a way to normalize document grades so that I can sort by result (relevance)?

I mean, the scores for document A from index A will not be comparable to document B from index B unless I do some kind of normalization ... isn’t it?

Thanks Roey

+3
source share
2 answers

Lucene. , , - (idf).

Luke , idfs . , .

idf, - - Solr. , .

Lucene . , , .

+4

A X Y. x = score(A,X) / max score of any document that is a hit for search on index X y = score(A,Y) / max score of any document that is a hit for search on index Y.

x y 0 1. x y, .

. .

, . ?

-1

Source: https://habr.com/ru/post/1712871/


All Articles