What platform and java version are required to run hadoop?
What are the problems with Hadoop 1.0?
Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?
How a task is scheduled by a jobtracker?
How would you tackle counting words in several text documents?
What are the Basics of Hadoop?
What is the default replication factor?
How to use Apache Zookeeper command line interface?
What is the default block size in hdfs?
What is HDFS - Hadoop Distributed File System?
What is the Use of SSH in Hadoop ?
What is the meaning of the term "non-DFS used" in Hadoop web-console?
Which are the three main hdfs-site.xml properties?