What should be the ideal replication factor in Hadoop Cluster?
Post New Answer View All Answers
Why do we use Hadoop?
If a data Node is full how it's identified?
What happens to a NameNode that has no data?
What happens to a namenode, when job tracker is down?
Does the hdfs client decide the input split or namenode?
What are the modes in which Apache Hadoop run?
Is hadoop a memory?
Where is the Mapper Output stored?
Did you ever built a production process in hadoop ? If yes then what was the process when your hadoop job fails due to any reason?
Explain what is sqoop in Hadoop ?
What is Schema on Read and Schema on Write?
What is Disk Balancer in Apache Hadoop?
Explain use cases where SequenceFile class can be a good fit?
what are the nodes in the Hadoop cluster?
How can I install Cloudera VM in my system?