nutch crawling stops after injector.

2019-09-01 02:37发布

问题:

here is my cygwin screen looks...

cygpath: can't convert empty path
Injector: starting at 2014-05-15 16:57:50
Injector: crawlDb: -dir/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Patch for HADOOP-7682: Instantiating workaround file system
Injector: total number of urls rejected by filters: 1
Injector: total number of urls injected after normalization and filtering: 0
Injector: Merging injected urls into crawl db.
Injector: overwrite: false
Injector: update: false
Injector: finished at 2014-05-15 16:57:52, elapsed: 00:00:02

回答1:

Total number of urls injected is 0. That is nothing to crawl.

Injector: total number of urls rejected by filters: 1
Injector: total number of urls injected after normalization and filtering: 0