Hadoop Interview Questions
Questions Answers Views Company eMail

Explain what is shuffling in mapreduce?

459

Explain what is distributed cache in mapreduce framework?

468

Mention what are the main configuration parameters that user need to specify to run mapreduce job?

571

Explain what is the function of mapreduce partitioner?

477

Explain what is heartbeat in hdfs?

29

Explain what is a difference between an input split and hdfs block?

25

Explain how indexing in hdfs is done?

25

Mention what is the best way to copy files between hdfs clusters?

65

Mention what is the difference between hdfs and nas?

61

What is a difference between an input split and hdfs block?

56

Mention what is the data storage component used by hadoop?

307

Mention what does the text input format do?

303

Mention what daemons run on a master node and slave nodes?

357

Explain what is namenode in hadoop?

315

Explain what is a sequence file in hadoop?

343


Un-Answered Questions { Hadoop }

What is the bag?

379


Compare Hadoop and RDBMS?

305


Explain the features of Apache Spark because of which it is superior to Apache MapReduce?

456


Compare hadoop & spark?

215


Define tasktracker.

492


What does /etc /init.d do?

534


Name several advantages of Apache Ambari?

53


What are the main classes of Data Transfer API?

5


Why are spark transformations lazy?

246


Explain HCatInputFormat?

5


Compare Spark vs Hadoop MapReduce

263


State syntax of the command that is used to drop a partition?

5


What do you mean by taskinstance?

518


What is Spark.executor.memory in a Spark Application?

236


What is contextual routing in flume?

776