how would you modify that solution to only count the number of unique words in all the documents?
No Answer is Posted For this Question
Be the First to Post Answer
What are the functionalities of jobtracker?
what is difference between int and intwritable?
How is the distance between two nodes defined in Hadoop?
How does master slave architecture in the hadoop?
What infrastructure do we need to process 100 TB data using Hadoop?
Explain the use of tasktracker in the hadoop cluster?
did you maintain the hadoop cluster in-house or used hadoop in the cloud?
What is difference between secondary namenode, checkpoint namenode & backupnod secondary namenode, a poorly named component of hadoop?
How can one increase replication factor to a desired value in Hadoop?
What is Partioner in hadoop? Where does it run
What are the most commonly defined input formats in Hadoop?
What are sink processors?