Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the applications of Apache ZooKeeper?
Explain the different logging levels in cassandra.
What is the biggest shortcoming of Spark?
What is Apache Spark and what are the benefits of Spark over MapReduce?
Define a daemon?
In Map Reduce why map write output to Local Disk instead of HDFS?
What is sink processors?
What is spark deploy mode?
Explain Usage of Hive?
Explain the filter transformation?
What exactly is spark?
Can you join multiple fields in apache pig scripts?
Which all languages Apache Spark supports?
Is Hive useful when making data warehouse applications?
What is a scarce system resource?