what is a datanode?
Explain the features of stand alone (local) mode?
Name some companies that use Hadoop?
What is HDFS block size and what did you chose in your project?
Explain the features of fully distributed mode?
Explain why do we need hadoop?
How many instances of tasktracker run on a hadoop cluster?
What is the use of combiners in the hadoop framework?
What stored in HDFS?
What is Apache Hadoop?
Define a metadata?
How hdfa differs with nfs?
Who invented hadoop?
What do shuffling do?
Suppose Hadoop spawned 100 tasks for a job and one of the task failed. What will Hadoop do?