run nutch2.3.1 on hadoop2

2019-07-25 22:07发布

站内文章 / 前沿技术

26 0

女 | 书童

私信

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效，请关闭广告屏蔽插件后再试):

问题:

I want to run nutch2.3.1 to crawl data on hadoop2. I have 3 nodes for hadoop2:

I deployed nutch2.3.1 to crawler1 and run it with following command: /usr/local/nutch/deploy/bin/crawl hdfs://xxx.xxx.xxx.xxx/urls/seed.txt test 5

It works and can crawl data ,but it looks like the crawl job only run on crawler1, the others nodes did not do any job for nutch.

my questions are:

Sorry for my poor English, I really appreciate any help you can provide.

标签： hadoop nutch

傲

女 | 书童

私信

Ta的文章更多文章

0条评论

还没有人评论过~