How job tracker schedules an assignment?
Can you define inputsplit in hadoop?
Explain how jobtracker schedules a task?
What are the features of Fully-Distributed mode?
What is the role of the secondary namenode?
Can you explain combiner?
How would you check whether your NameNode is working or not?
How is security achieved in Hadoop?
Define a udf?
What is namenode?
What are the side effects of not running a secondary name node?
Can you explain speculative execution?
What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?
What type of data hadoop can handle ?
What are the port numbers of task tracker?