Hadoop Interview Questions, Answers for Freshers and Experienced asked in various Company Job Interviews

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)

Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)

Hadoop Interview Questions

Questions Answers Views Company eMail

What do you understand by standalone (or local) mode?

473

What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?

478

What is pseudo-distributed mode?

648

When we send a data to a node, do we allow settling in time, before sending another data to that node?

528

Can we write map reduce program in other than java programming language. How?

525

Are job tracker and task trackers present in separate machines?

505

What is namenode?

520

what if job tracker machine is down?

514

What are the four basic parameters of a mapper?

619

In which location name node sores its metadata and why?

706

What are input format, input split & record reader and what they do?

671

How namenode handles data node failures?

569

What is the non dfs used?

594

How can one write custom record reader?

529

Whats the default port that jobtrackers listens ?

515

Un-Answered Questions { Hadoop }

Which are the methods to create rdd in spark?

301

What are the key features of HDFS?

Why comparison of types is important for MapReduce?

1008

What are the relation operations in Pig? Explain any two with examples?

759

What are the different methods to set up local repositories?

Name different types of primary keys in Cassandra?

Can you explain recommendation engine?

Why there is need of pig language?

621

What is the definition of Hive?

776

How to set property in apache tajo?

Why HDFS stores data using commodity hardware despite the higher chance of failures in hadoop?

Data Engineer Given a list of followers in the format:123, 345234, 678345, 123â€¦Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?

786

What are the debugging tools used for Apache Pig scripts?

823

What is Distributed Cache in Hadoop?

502

How Hive distributes the rows into buckets?

850

For More Un-Answered { Hadoop } Questions Click Here