how would you modify that solution to only count the number of unique words in all the documents?
No Answer is Posted For this Question
Be the First to Post Answer
How the HDFS Blocks are replicated?
What do masters consist of?
What is partioner in hadoop? Where does it run,mapper or reducer?
How can one increase replication factor to a desired value in Hadoop?
Explain the use of tasktracker in the hadoop cluster?
What does /etc /init.d do?
Why do we use HDFS for applications having large data sets and not when there are lot of small files?
Is map like a pointer?
What is a Task instance in Hadoop? Where does it run?1
What is the full form of fsck?
How to change Replication Factor For below cases ?
What is configuration of a typical slave node on Hadoop cluster? How many JVMs run on a slave node?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)