1) What is Hadoop Streaming?
Hadoop streaming is a utility which allows
you to create and run map/reduce job. It is a generic API that allows programs
written in any languages to be used as Hadoop mapper
2) What is a combiner in Hadoop?
A Combiner is a mini-reduce process which
operates only on data generated by a Mapper. When Mapper emits the data,
combiner receives it as input and sends the output to reducer.
Comments
Post a Comment