Create your own java class that manages the Nutch loop. It should look like org.apache.nutch.crawl.Crawl, but you will need to replace the indexer call with the call to your Mysql connector. Or, you can call your Mysql connector during each cycle, depending on whether you want to update Mysql at the end of the crawl or during its execution.
source
share