What is the functionality of jobtracker in hadoop? How many instances of a jobtracker run on hadoop cluster?
What are the functions of NameNode?
Explain the hadoop configuration files at present?
what is the typical block size of an HDFS block?
Explain the features of stand alone (local) mode?
What is partioner in hadoop? Where does it run,mapper or reducer?
What is Apache Hadoop YARN?
What is HDFS ? How it is different from traditional file systems?
What is structured data?
Why do we use HDFS for applications having large data sets and not when there are lot of small files?
What is the difference between Apache Hadoop and RDBMS?
What are the two main components of ResourceManager?
Where is the Mapper Output intermediate kay-value data stored ?