Integration of Solr 6 and Nutch 2.3.1

According to Nutch news, the latest version of Nutch 2.3.1 is compatible with Solr 4.10.3, which is a very old version of solr.

Is it possible to integrate Solr 6 with Nutch 2.3.1. What will be the disadvantages if solr 6 is integrated? Has anyone tried this?

+4
source share
1 answer

This is an old question, but I just got Nutch 1.12 talking to Solr 6.3.0. The required / solrconfig schema changes should be the same for Nutch 2.x, so here is what I did:

Download and extract both products to a directory, for example. ~ / mycrawler, then go to the solr directory and create the kernel for nutch:

solr-6.3.0/bin $ ./solr start
solr-6.3.0/bin $ ./solr create_core -c nutch -d basic_configs
solr-6.3.0/bin $ ./solr stop

solr-6.3.0/server/solr/nutch, .. schema.xml:

solr-6.3.0/server/solr/nutch/conf $ rm managed-schema
solr-6.3.0/server/solr/nutch/conf $ cp ~/mycrawler/apache-nutch-1.12/conf/schema.xml .

schema.xml enablePositionIncrements="true" <filter class="solr.StopFilterFactory" ignoreCase="true" ....

solr-6.3.0/server/solr/nutch/conf/solrconfig.xml TypeMapping, :

<processor class="solr.AddSchemaFieldsUpdateProcessorFactory">
  <str name="defaultFieldType">strings</str>
    <!--
  <lst name="typeMapping">
    <str name="valueClass">java.lang.Boolean</str>
    <str name="fieldType">booleans</str>
  </lst>
  <lst name="typeMapping">
    <str name="valueClass">java.util.Date</str>
    <str name="fieldType">tdates</str>
  </lst>
  <lst name="typeMapping">
    <str name="valueClass">java.lang.Long</str>
    <str name="valueClass">java.lang.Integer</str>
    <str name="fieldType">tlongs</str>
  </lst>
  <lst name="typeMapping">
    <str name="valueClass">java.lang.Number</str>
    <str name="fieldType">tdoubles</str>
  </lst>
    -->
</processor>

:

solr-6.3.0/bin $ ./solr start

gui, ​​ - .

script Solr (, , Nutch 2):

./crawl -i \
    -D solr.server.url=http://localhost:8983/solr/nutch \ 
    ~/mycrawler/nutch_work/seed \
    ~/mycrawler/nutch_work/crawl  \
    1
+9

Source: https://habr.com/ru/post/1648773/


All Articles