Why do we use HDFS for applications having large data sets and not when there are lot of small files?
1 2370Post New Apache Hadoop Questions
What is the function of NodeManager?
How can one increase replication factor to a desired value in Hadoop?
Virtual Box & Ubuntu Installation?
Is hadoop required for data science?
What is inputformat in hadoop?
Explain why the name ‘hadoop’?
How to change Replication Factor For below cases ?
How would you tackle counting words in several text documents?
What is Partioner in hadoop? Where does it run
How to configure hadoop to reuse JVM for mappers?
What is Hadoop streaming?
Is it possible to provide multiple inputs to hadoop? If yes, explain.
Explain the basic difference between traditional rdbms and hadoop?
What Mapper does?
What are the two main parts of the hadoop framework?