What are the network requirements for hadoop?
What does hadoop-metrics.properties file do?
What are some typical functions of job tracker in hadoop?
What are the important modes of hadoop?
What mode(s) can hadoop code be run in?
Mention what is the difference between an rdbms and hadoop?
What are configuration files in Hadoop?
What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?
What is NameNode? How NameNode tackle Datanode failures in Hadoop?
Is Namenode machine same as DataNode machine as in terms of hardware?
Define data cleansing?
Explain how can you check whether namenode is working beside using the jps command?
What is Identity reducer?
Is nosql follow relational db model?
What is a record reader?