What is a partitioner and how the user can control which key will go to which reducer?
No Answer is Posted For this Question
Be the First to Post Answer
Does Partitioner run in its own JVM or shares with another process?
How can we control particular key should go in a specific reducer?
What is shuffling and sorting in mapreduce?
List out Hadoop's three configuration files?
How to compress mapper output in Hadoop?
What is Output Format in MapReduce?
Difference between mapreduce and spark
When should you use a reducer?
what is NameNode in Hadoop?
How to overwrite an existing output file during execution of mapreduce jobs?
Explain what does the conf.setmapper class do?
How would you tackle calculating the number of unique visitors for each hour by mining a huge apache log? You can use post processing on the output of the mapreduce job.
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)