What is difference between a MapReduce InputSplit and HDFS block
What is a "reducer" in Hadoop?
What is a RecordReader in Hadoop MapReduce?
What is the role of recordreader in hadoop mapreduce?
What happens when a datanode fails ?
What is the utility of using Writable Comparable Custom Class in Map Reduce code?
How many Reducers run for a MapReduce job in Hadoop?
What is the purpose of textinputformat?
what happens in textinformat ?
How to optimize MapReduce Job?
What is shuffling and sorting in Hadoop MapReduce?
What is reduce side join in mapreduce?
What is the data storage component used by Hadoop?
Is it possible to split 100 lines of input as a single split in MapReduce?
How would you tackle calculating the number of unique visitors for each hour by mining a huge apache log? You can use post processing on the output of the mapreduce job.