Map reduce jobs take too long. What can be done to improve the performance of the cluster?
No Answer is Posted For this Question
Be the First to Post Answer
What is heartbeat in hdfs? Explain.
Why do we need MapReduce during Pig programming?
When should you use a reducer?
What is the role of recordreader in hadoop mapreduce?
What are the various configuration parameters required to run a mapreduce job?
Write a short note on the disadvantages of mapreduce
what does the conf.setMapper Class do ?
Clarify what is shuffling in map reduce?
If reducers do not start before all mappers finish then why does the progress on mapreduce job shows something like map(50%) reduce(10%)? Why reducers progress percentage is displayed when mapper is not finished yet?
Where sorting is done on mapper node or reducer node in MapReduce?
What is LazyOutputFormat in MapReduce?
What is identity mapper and identity reducer?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)