Apache Hadoop Interview Questions
Questions Answers Views Company eMail

Why do we use HDFS for applications having large data sets and not when there are lot of small files?

1 2341

What are the functions of NameNode?

1 1583

How to configure hadoop to reuse JVM for mappers?

906

How to resolve IOException: Cannot create directory

778

How to change replication factor of files already stored in HDFS?

808

Which one is default InputFormat in Hadoop ?

1 1876

shouldn't DFS be able to handle large volumes of data already?

868

what is a datanode?

733

How does NameNode tackle DataNode failures?

966

What is InputSplit and RecordReader?

739

What is the purpose of dfsadmin tool?

1025

How can you connect an application

818

how is a file of the size 1 GB uncompressed

674

Is map like a pointer?

758

What is the default replication factor?

775


Post New Apache Hadoop Questions

Un-Answered Questions { Apache Hadoop }

On which port does ssh work?

534


How can I install Cloudera VM in my system?

747


How is the distance between two nodes defined in Hadoop?

1262


What is rack-aware replica placement policy?

776


What is Schema on Read and Schema on Write?

734






What is speculative execution in Hadoop?

875


What are the modules that constitute the Apache Hadoop 2.0 framework?

801


Have you ever used Counters in Hadoop. Give us an example scenario?

831


What are sink processors?

729


What does the command mapred.job.tracker do?

530


How blocks are distributed among all data nodes for a particular chunk of data?

840


How we can change Replication factor when Data is on the fly?

1164


Explain a simple Map/Reduce problem.

548


How does NameNode tackle DataNode failures?

966


What is hbase in hadoop?

485