Why Hadoop performs replication, although it results in data redundancy?
what is next step after mapper or maptask?
Ideally what should be the block size in hadoop?
Web-ui shows that half of the datanodes are in decommissioning mode. What does that mean? Is it safe to remove those nodes from the network?
What is a combiner in hadoop?
Whats the default port that jobtrackers listens ?
What are the important features of hadoop?
Explain InputSplit in Hadoop?
Why is Data Block size set to 128 MB in Hadoop?
What is a record reader?
What happens when two clients try to access the same file in the hdfs?
Why cloudera is used?
How can you native libraries be included in yarn jobs?
What do you understand by unit and ()in scala?
Why can we not create directory /user/dataflair/inpdata001 when name node is in safe mode?