Creating indices in solr on top of HBase

In any case, I can create indexes in Solr for full-text search from HBase for Near Real Time.

I did not want to store all the text in my solr indices. Made by "stored=false"

Note. โ€œRemembering that I am working on large data sets and want to perform a real-time search.โ€ We are talking about TB / PB data.

UPDATED

Cloudera distribution: 5.4.x is used with the Cloudera search components.

Solr: 4.10.x

HBase: 1.0.x

Indexing Service: Lily HBase index index with cloudera morpholines

Are there any other NRT Indexer services or frameworks that can be used instead of Lily on Cloudera . I am also looking for other options.

+2
source share
1 answer

Cloudera: please check in this article and Hbase-Solr using Cloudera-search , which describes how to achieve this. see screenshot as described in these articles. Bird view of hbase solr integration Look for known issues with Cloudera Search

Yes, you can consider Morphlines. they can be used for both real-time applications and batch processing.

I know little about the hortonworks platform and how to do this.

+2
source

Source: https://habr.com/ru/post/1234829/


All Articles