Apache Hadoop Interview Questions
Questions Answers Views Company eMail

Why do we use HDFS for applications having large data sets and not when there are lot of small files?

1 2132

What are the functions of NameNode?

1 1413

How to configure hadoop to reuse JVM for mappers?

793

How to resolve IOException: Cannot create directory

679

How to change replication factor of files already stored in HDFS?

713

Which one is default InputFormat in Hadoop ?

1 1670

shouldn't DFS be able to handle large volumes of data already?

776

what is a datanode?

641

How does NameNode tackle DataNode failures?

879

What is InputSplit and RecordReader?

647

What is the purpose of dfsadmin tool?

917

How can you connect an application

724

how is a file of the size 1 GB uncompressed

591

Is map like a pointer?

679

What is the default replication factor?

695


Post New Apache Hadoop Questions

Un-Answered Questions { Apache Hadoop }

What is DistributedCache and its purpose?

632


Explain a simple Map/Reduce problem.

453


Explain the core components of hadoop?

398


What are the port numbers of namenode, job tracker and task tracker?

453


How Mapper is instantiated in a running job?

749






What are sink processors?

645


Define fault tolerance?

403


What are the network requirements for using hadoop?

384


Is client the end user in HDFS?

726


How does a namenode handle the failure of the data nodes?

428


What is the default block size in hdfs?

736


What alternate way does HDFS provides to recover data in case a Namenode

675


Explain Erasure Coding in Apache Hadoop?

420


On what basis Namenode will decide which datanode to write on?

1022


Name the various types of lists supported by bootstrap.

408