Why do we use HDFS for applications having large data sets and not when there are lot of small files?
1 2768Post New Apache Hadoop Questions
What is IdentityMapper?
What do you mean by taskinstance?
What is Schema on Read and Schema on Write?
What is Partioner in hadoop? Where does it run
What is the meaning of the term "non-DFS used" in Hadoop web-console?
What is version-id mismatch error in hadoop?
What is DistributedCache and its purpose?
Define a namenode?
What happens to job tracker when namenode is down?
How to restart Namenode?
How a task is scheduled by a jobtracker?
Explain what if rack 2 and datanode fails?
What are the main components of a Hadoop Application?
What is high availability in hadoop?
What are combiners and its purpose?