Explain Erasure Coding in Hadoop?
Suppose hadoop spawned 100 tasks for a job and one of the tasks failed. What will hadoop do?
What is the number of default partitioner in hadoop?
What is replication factor?
When and how to create hadoop archive?
Mention what job does the conf class do?
What is distributed copy (distcp)?
Are job tracker and task trackers present in separate machines?
Explain how can we check whether namenode is working or not?
How to add/delete a Node to the existing cluster?
What is JobTracker?
What are the important modes of hadoop?
Explain is it possible to search for files using wildcards?