what is the typical block size of an HDFS block?
Which are the three main hdfs-site.xml properties?
Explain InputFormat?
What is the difference between a Hadoop and Relational Database and Nosql?
Define a job tracker?
How indexing is done in HDFS?
Why is hadoop faster?
Define streaming access?
What platform and java version are required to run hadoop?
What is Derby database?
how would you modify that solution to only count the number of unique words in all the documents?
What is HDFS Federation?
What is configuration of a typical slave node on Hadoop cluster? How many JVMs run on a slave node?