What is the difference between namenode and datanode in hadoop?
Ideally what should be the replication factor in hadoop?
What is rack awareness in hadoop?
While starting hadoop services, datanode service is not running?
what happens when namenode goes down during file read operation in hadoop?
Why slaves limited to 4000 in hadoop version 1?
What is heartbeat in hadoop?
Explain small file problem in hadoop
How to create the directory when name node is in safe mode?
Why can we not create directory /user/dataflair/inpdata001 when name node is in safe mode?
Ideally what should be the block size in hadoop?
What happen when namenode enters in safemode in hadoop?
What is shuffleing in mapreduce?
Does mapreduce programming model provide a way for reducers to communicate with each other? In a mapreduce job can a reducer communicate with another reducer?
How would you tackle calculating the number of unique visitors for each hour by mining a huge apache log? You can use post processing on the output of the mapreduce job.