what is next step after mapper or maptask?
What is Identity reducer?
What is Rack Awareness? What is its need in Hadoop?
What are the different commands used to startup and shutdown Hadoop daemons?
What is Distributed Cache in Hadoop?
Explain the usage of Context Object?
What happen when namenode enters in safemode in hadoop?
After the Map phase finishes, the Hadoop framework does 'Partitioning, Shuffle and sort'. Explain what happens in this phase?
What are the side effects of not running a secondary name node?
What are the restriction to the key and value class ?
Define a udf?
How do you overwrite replication factor?
What happen if number of reducer is set to 0 in Hadoop?
Explain what is storage and compute nodes?
Mention what is distributed cache in hadoop?