This video is a speech with Shay Banon, the developer of ElasticSearch, a distributed full-text search engine. In the video, he discusses the pros and cons of the terminal section and the document-based section.
Basically, a term-based partition creates too much network bandwidth between processes / nodes. And it’s harder to implement beautifully. Document-based is extremely simplified to implement and produce results.
In addition, in this lecture by Jeffrey Dean , he also explains the differences and says that Google uses document-based separation.
These are the two main ways to distribute your search engine. I do not know other ways to do this. In any case, you may want to find informational search literature for new works on this subject.
source share