What are the key differences between Pig vs MapReduce?
No Answer is Posted For this Question
Be the First to Post Answer
Map reduce jobs are failing on a cluster that was just restarted. They worked before restart. What could be wrong?
What do you understand by mapreduce?
Explain JobConf in MapReduce.
In MapReduce, ideally how many mappers should be configured on a slave?
What is the Reducer used for?
Explain what is “map” and what is "reducer" in hadoop?
How would you tackle calculating the number of unique visitors for each hour by mining a huge apache log? You can use post processing on the output of the mapreduce job.
What is shuffling and sorting in Hadoop MapReduce?
What happens when the node running the map task fails before the map output has been sent to the reducer?
Can we set the number of reducers to zero in MapReduce?
How to overwrite an existing output file/dir during execution of Hadoop MapReduce jobs?
How many times combiner is called on a mapper node in Hadoop?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)