What are the benefits yarn brings in to hadoop?
Why is checkpointing important in hadoop?
How will you write a custom partitioner for a Hadoop job?
What are the important features of hadoop?
Give me the examples of Columnar database ?
Explain InputSplit in Hadoop?
Can you explain sequence file in hadoop?
What are some typical functions of Job Tracker?
Tell me some major benefits of Hadoop?
If we want to copy 10 blocks from one machine to another, but another machine can copy only 8.5 blocks, can the blocks be broken at the time of replication?
What are the core components of Hadoop?
How can you set an arbitrary number of mappers to be created for a job in Hadoop?
What do you know about the speculative execution?
How can one check whether NameNode is working or not?
What are input format, input split & record reader and what they do?