Combiner without Reducer in Hadoop

2019-07-04 13:02发布

Can I write a Hadoop code that has only Mappers and Combiners (i.e. mini-reducers with no reducer)?

job.setMapperClass(WordCountMapper.class);
job.setCombinerClass(WordCountReducer.class);

conf.setInt("mapred.reduce.tasks", 0);

I was trying to do so but I always see that I have one reduce task on the job tracker link

Launched reduce tasks = 1

How can I delete reducers while keeping combiners? is that possible?

标签： hadoop mapreduce

2条回答

再贱就再见

2楼-- · 2019-07-04 13:19

You need to tell your job that you don't care about the reducer: JobConf.html#setNumReduceTasks(int)

// new Hadoop API
jobConf.setNumReduceTasks(0);

// old Hadoop API
job.setNumReduceTasks(0);

You can achieve the something with IdentityReducer.

Performs no reduction, writing all input values directly to the output.

I'm not sure whether you can keep combiners but I will start with the previous lines.

0人赞添加讨论(0) 举报

劳资没心，怎么记你

3楼-- · 2019-07-04 13:34

In the case you describe you should use Reducers. Use as key: Context.getInputSplit().getPath() + Context.getInputSplit().getStart() - this combination is unique for each Mapper.

0人赞添加讨论(0) 举报

Combiner without Reducer in Hadoop

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间