what does the text input format do?
Why Mapper runs in heavy weight process and not in a thread in MapReduce?
What is identity mapper and chain mapper?
How would you tackle calculating the number of unique visitors for each hour by mining a huge apache log? You can use post processing on the output of the mapreduce job.
What is the need of key-value pair to process the data in MapReduce?
Is reduce-only job possible in Hadoop MapReduce?
In Map Reduce why map write output to Local Disk instead of HDFS?
A number of combiners can be changed or not in MapReduce?
What is streaming?
How will you submit extra files or data ( like jars, static files, etc. ) For a mapreduce job during runtime?
Define Writable data types in MapReduce?
What is a "reducer" in Hadoop?
How to write a custom partitioner for a Hadoop MapReduce job?
What are the configuration parameters in the 'MapReduce' program?
What is a IdentityMapper and IdentityReducer in MapReduce ?