Hadoop Interview Questions
Questions Answers Views Company eMail

What are the functions of NameNode?

1 1607

How to configure hadoop to reuse JVM for mappers?

920

mapper or reducer?

737

How to resolve IOException: Cannot create directory

792

Does Pig support multi-line commands?

678

How to change replication factor of files already stored in HDFS?

826

Which one is default InputFormat in Hadoop ?

1 1906

shouldn't DFS be able to handle large volumes of data already?

878

What is Apache Pig?

660

what is a datanode?

742

How does NameNode tackle DataNode failures?

973

What is InputSplit and RecordReader?

746

What is the purpose of dfsadmin tool?

1041

How can you connect an application

836

What are combiners? When should I use a combiner in my MapReduce Job?

757


Un-Answered Questions { Hadoop }

Explain how do ‘map’ and ‘reduce’ work?

445


What is a rack awareness algorithm and why is it used in hadoop?

25


What is Data Locality in Hadoop?

482


What are the different clustering in mahout?

39


What are the prime features of apache zookeeper?

1


Say when to pick “inward table” and “outside table” in hive?

542


What is spark reducebykey?

231


What is network topology strategy?

65


What is the key- value pair in MapReduce?

553


when hadoop enter in safe mode?

559


What do you mean by column family in Cassandra?

76


Define a namenode?

492


What is streaming access?

357


Define sparksession in apache spark? Why is it needed?

230


Why should we use ‘orderby’ keyword in pig scripts?

368