How to use Apache Nutch through a Java application?

I need to create a Java / Java EE search engine using apache nutch. I searched over the Internet and I found many articles on installing apache nutch, but could not find any article / tutorial on the java program for accessing or managing apache nutch for crawling.

+4
source share
3 answers

Apache Nutch will only help you crawl the data, but you need to index what it finds on the search server. This is where Apache Solr comes in. You can then write a Java application to interact with Solr using SolrJ .

Watch this thread on which versions of Nutch and Solr work best.

+1
source

You can see an example in my GitHub repository: https://github.com/yegor256/nutch-in-java Here I use Nutch without a command line, directly from a Java application.

0
source

Nutch will be your backend to scan. You will then use the solr index, and then your interface will look for that solr index. See this link here http://www.building-blocks.com/thinking/building-a-search-engine-with-nutch-and-solr-in-10-minutes

-3
source

Source: https://habr.com/ru/post/1486175/


All Articles