After the Map phase finishes, the Hadoop framework does 'Partitioning, Shuffle and sort'. Explain what happens in this phase?
516Hadoop achieves parallelism by dividing the tasks across many nodes, it is possible for a few slow nodes to rate-limit the rest of the program and slow down the program. What mechanism Hadoop provides to combat this?
490Post New Hadoop General Questions
What are the benefits yarn brings in to hadoop?
What is the block size in Hadoop?
Explain what is jobtracker in hadoop? What are the actions followed by hadoop?
Hadoop achieves parallelism by dividing the tasks across many nodes, it is possible for a few slow nodes to rate-limit the rest of the program and slow down the program. What mechanism Hadoop provides to combat this?
Is it possible to provide multiple input to Hadoop? If yes then how?
What is meant by streaming access?
How namenode handles data node failures?
Explain the usage of Context Object?
What happens if one hadoop client renames a file or a directory containing this file while another client is still writing into it?
What is the command to change the replication factor ?
What is the basic difference between traditional RDBMS and Hadoop?
Which object can be used to get the progress of a particular job
Explain what is webdav in hadoop?
List out the some common problems faced by data analyst?
Are job tracker and task trackers present in separate machines?