what are the main configuration parameters that user need to specify to run Mapreduce Job ?
What is the inputsplit in map reduce software?
Which are the methods in the mapper interface?
Is it legal to set the number of reducer task to zero? Where the output will be stored in this case?
What is SequenceFileInputFormat?
What is the relationship between Job and Task in Hadoop?
In Hadoop, which file controls reporting in Hadoop?
What are the the issues associated with the map and reduce slots based mechanism in mapReduce?
Explain the differences between a combiner and reducer
What is RecordReader in a Map Reduce?
Explain what is distributed cache in mapreduce framework?
What is Data Locality in Hadoop?
How does Mappers run method works?