How would you tackle calculating the number of unique visitors for each hour by mining a huge apache log? You can use post processing on the output of the mapreduce job.
No Answer is Posted For this Question
Be the First to Post Answer
What is the function of mapreduce partitioner?
Is it possible to split 100 lines of input as a single split in MapReduce?
What are ‘maps’ and ‘reduces’?
Explain about the partitioning, shuffle and sort phase
What happens if the quantity of the reducer is 0 in mapreduce?
What is a partitioner and how the user can control which key will go to which reducer?
what is JobTracker in Hadoop? What are the actions followed by Hadoop?
How does Hadoop Classpath plays a vital role in stopping or starting in Hadoop daemons?
What is optimal size of a file for distributed cache?
What is the inputsplit in map reduce software?
What are combiners? When should I use a combiner in my MapReduce Job?
Differentiate Reducer and Combiner in Hadoop MapReduce?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)