Why do we use HDFS for applications having large data sets and not when there are lot of small files?
What does job conf class do?
In which location Name Node stores its Metadata and why?
What is zookeeper in hadoop?
Command to format the NameNode?
Where is the Mapper Output intermediate kay-value data stored ?
What are different types of filesystem?
How is HDFS fault tolerant?
Explain the features of pseudo mode?
Explain the core components of hadoop?
What do you know about nlineoutputformat?
What is the purpose of RawComparator interface?
On what basis Namenode will decide which datanode to write on?