What are the core components of Apache Hadoop?
Explain how do ‘map’ and ‘reduce’ works?
What happens to a NameNode that has no data?
What is a secondary namenode?
What are the functions of NameNode?
Why are the number of splits equal to the number of maps?
Where is the Mapper Output intermediate kay-value data stored ?
What are the functionalities of jobtracker?
What is HDFS Block size? How is it different from traditional file system block size?
What is high availability in hadoop?
What is cloudera and why it is used?
Define a metadata?
Name some companies that use Hadoop?