Why do we use HDFS for applications having large data sets and not when there are lot of small files?
What happens to a NameNode that has no data?
What are the functionalities of jobtracker?
What is output format in hadoop?
What is high availability in hadoop?
Is it necessary to write jobs for hadoop in the java language?
Why do we use Hadoop?
How client application interacts with the NameNode?
How Mapper is instantiated in a running job?
How is security achieved in Apache Hadoop?
What is hbase in hadoop?
What are the core components of Apache Hadoop?
What is HDFS - Hadoop Distributed File System?