How many JVMs run on a slave node?
What problems have you faced when you are working on Hadoop code?
What is a JobTracker in Hadoop? How many instances of JobTracker run on a Hadoop Cluster?
Is hadoop a database?
What is the best hardware configuration to run Hadoop?
Is hadoop obsolete?
What are the benefits of block transfer?
Define streaming?
Explain the features of pseudo mode?
Define fault tolerance?
What is the purpose of DataNode block scanner?
What is a Combiner?
Explain how do ‘map’ and ‘reduce’ works?