Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is secondary namenode? Is it a substitute or back up node for the namenode?
Explain the architecture of Hadoop Pig?
What is azure spark?
What is lambda architecture spark?
When NameNode enter in Safe Mode?
What do you mean by consistency in Cassandra?
What happen if number of reducer is set to 0 in Hadoop?
If there is certain data that we want to use again and again in different transformations, what should improve the performance?
What is the use of Bloom Filter in Cassandra?
How jobtracker assign tasks to the tasktracker?
Why Flume?
What is hdfs spark?
Name some independent extensions that contribute to the Ambari codebase?
Explain how RDDs work with Scala in Spark
What is Spark Driver?