Hadoop Interview Questions, Answers for Freshers and Experienced asked in various Company Job Interviews

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)

Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)

Hadoop Interview Questions

Questions Answers Views Company eMail

What happens to job tracker when namenode is down?

722

How can I restart namenode?

760

What do masters consist of?

725

What if a namenode has no data?

712

Which are the three main hdfs-site.xml properties?

709

What is formatting of the dfs?

868

In cloudera there is already a cluster, but if I want to form a cluster on ubuntu can we do it?

779

Does this lead to security issues?

775

How to change from su to cloudera?

769

Why password is needed in ssh localhost?

851

What happens to a namenode, when job tracker is down?

794

What does the command mapred.job.tracker do?

831

How to come out of the insert mode?

840

What is the full form of fsck?

743

Why do we need a password-less ssh in fully distributed environment?

760

Un-Answered Questions { Hadoop }

How to set mappers and reducers for MapReduce jobs?

679

Explain the rudimentary difference between Cassandra and HBase?

how you can get exactly once messaging from Kafka during data production?

598

Mention Hive default read and write classes?

798

What is structured data?

694

Why can we not create directory /user/dataflair/inpdata001 when name node is in safe mode?

484

Give me examples of unstructured data?

525

Is it possible to create multiple table in hive for same data?

728

Explain the role of Streams API?

585

What are the features of RDD, that makes RDD an important abstraction of Spark?

301

Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?

314

Specify the partitions in hive?

697

Tell any two features of flume?

116

How can one set space quota in Hadoop (HDFS) directory?

HDFS is used for applications with large data sets, not why Many small files?

For More Un-Answered { Hadoop } Questions Click Here