Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is difference between secondary namenode, checkpoint namenode & backupnod secondary namenode, a poorly named component of hadoop?
765What mechanism does hadoop framework provides to synchronize changes made in distribution cache during runtime of the application?
714Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
474
Give examples of the SerDe classes whihc hive uses to Serializa and Deserilize data?
What is indexing and why do we need it?
Explain the repartition() operation in Spark?
How can you achieve high availability in Apache Spark?
what is (HS2) HiveServer2?
How to create table in hbase?
What is Rack Awareness? What is its need in Hadoop?
It can be possible that a Job has 0 reducers?
What is the use of expand cqlsh command in Cassandra?
How do I set up flume agent?
Is it possible to provide multiple input to Hadoop? If yes then how can you give multiple directories as input to the Hadoop job?
How can we create a hadoop cluster from scratch?
Which file systems does Spark support?
What combiners are and when you should use a combiner in a mapreduce job?
Mention what is the data storage component used by hadoop?