I have a crawl site that includes some links to PDF files. I want nutch to scan this link and dump them as .pdf files. I use Apache Nutch1.6, also I do it in java as
ToolRunner.run(NutchConfiguration.create(), new Crawl(), tokenize(crawlArg)); SegmentReader.main(tokenize(dumpArg));
can someone help me on this
source share