After the Map phase finishes, the Hadoop framework does 'Partitioning, Shuffle and sort'. Explain what happens in this phase?
No Answer is Posted For this Question
Be the First to Post Answer
What are the features of Fully-Distributed mode?
What is a commodity hardware? Does commodity hardware include RAM?
What is partitioning?
What is meant by streaming access?
What is the use of context object?
How is the splitting of file invoked in Hadoop ?
How many datanodes can run on a single Hadoop cluster?
Mention how many inputsplits is made by a hadoop framework?
What are the side effects of not running a secondary name node?
What does secondary name-node means?
Which database is used in hadoop?
What does the high availability of a name-node means? How is it accomplished?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)