As we know that Hadoop
guarantees that the same keys which come from different mappers
will be sent to the same reducer
.
But if two different keys have the same hash value, they definitely will go to the same reducer
, so will them be sent to the same reduce
method orderly? Which part is responsible for this logic?
Thanks a lot!
Q1:
so will them be sent to the same reduce method orderly
Ans : yes
Q2:
Which part is responsible for this logic
Ans : shuffle sort
Example :
Lets say number of reducer is 5 so now .