After increasing the replication level, I still see that data is under replicated. What could be wrong?
429Web-ui shows that half of the datanodes are in decommissioning mode. What does that mean? Is it safe to remove those nodes from the network?
420Post New Hadoop General Questions
List of the some best tools that can be useful for data-analysis?
How ordering in hdfs is finished?
Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in hadoop?
What are the important features of hadoop?
Is it possible to have hadoop job output in multiple directories? If yes, how?
What is the block size in Hadoop?
Give me examples of unstructured data?
What is a Heartbeat in Hadoop?
What are the main components of hadoop?
What is NameNode? How NameNode tackle Datanode failures in Hadoop?
What is the characteristic of streaming API that makes it flexible run MapReduce jobs in languages like Perl, Ruby, Awk etc.?
Which scala library is used for functional programming?
Can I set the number of reducers to zero?
Explain the key benefits of using storm for real time processing?
List Hadoop’s three configuration files?