Explain the features of pseudo mode?
Should we use RAID in Hadoop or not?
how can we change Replication Factor?
Define tasktracker.
How to change from su to cloudera?
Explain why do we need hadoop?
Did you ever ran into a lop sided job that resulted in out of memory error
How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
Explain the hadoop configuration files at present?
Why do we use HDFS for applications having large data sets and not when there are lot of small files?
How will you make changes to the default configuration files?
What is the purpose of dfsadmin tool?
Is it necessary to know java to learn hadoop?