How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
How client application interacts with the NameNode?
Explain the basic architecture of Hadoop?
What is partioner in hadoop? Where does it run,mapper or reducer?
What should be the ideal replication factor in Hadoop Cluster?
Explain the overview of hadoop history breifly?
How many instances of a jobtracker run on hadoop cluster?
How to resolve small file problem in hdfs?
Is hadoop the future?
What are the benefits of block transfer?
Define streaming?
What is Hadoop Custom partitioner ?
Whats is distributed cache in hadoop?