Big Data Interview Questions, Answers for Freshers and Experienced asked in Job Interviews

Apache Hadoop (387)
MapReduce (351)
Apache Hive (334)
Apache Pig (225)

Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (188)

Big Data General (101)
Big Data AllOther (3)

Un-Answered Questions { Big Data }

What is the difference between namenode and datanode in hadoop?

495

Ideally what should be the replication factor in hadoop?

426

What is rack awareness in hadoop?

487

While starting hadoop services, datanode service is not running?

439

what happens when namenode goes down during file read operation in hadoop?

593

Why slaves limited to 4000 in hadoop version 1?

487

What is heartbeat in hadoop?

426

Explain small file problem in hadoop

421

How to create the directory when name node is in safe mode?

547

Why can we not create directory /user/dataflair/inpdata001 when name node is in safe mode?

449

Ideally what should be the block size in hadoop?

461

What happen when namenode enters in safemode in hadoop?

431

What is shuffleing in mapreduce?

648

Does mapreduce programming model provide a way for reducers to communicate with each other? In a mapreduce job can a reducer communicate with another reducer?

679

How would you tackle calculating the number of unique visitors for each hour by mining a huge apache log? You can use post processing on the output of the mapreduce job.

826