Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the difference between Cassandra, Pig and Hive?
What is wal and hlog in hbase?
What is CAP Theorem? What aspects does Hadoop support from this theorem?
What is a namenode? How many instances of namenode run on a hadoop cluster?
What does heartbeat in hdfs means?
What are the differences between Caching and Persistence method in Apache Spark?
What is a Record Reader in hadoop?
Explain InputSplit in Hadoop?
Does spark load all data in memory?
What is the difference between an input split and hdfs block?
What do you mean by ZNode?
Explain fullOuterJoin() operation in Apache Spark?
How to specify more than one path for storage in Hadoop?
What are the primary phases of a Reducer?
What are the types of Apache Spark transformation?