Define streaming?
What is a checkpoint?
In cloudera there is already a cluster, but if I want to form a cluster on ubuntu can we do it?
What does the command mapred.job.tracker do?
when hadoop enter in safe mode?
What is Partioner in hadoop? Where does it run
What is the default replication factor?
What is the default block size in Hadoop 1 and in Hadoop 2? Can it be changed?
How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
How to change Replication Factor For below cases ?
What is formatting of the dfs?
What Mapper does?
What other technologies have you used in hadoop sta ck?