Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How do you set up a spark?
What are accumulators in Apache Spark?
What is fluming?
What is the relationship between Hadoop, HBase, Hive and Cassandra ?
What is the no. Of threads created by impala?
What happens when you submit spark job?
What is the purpose of button groups?
What is avro format?
What do you understand by Commit log in Cassandra?
What is partitioning?
Is spark faster than hadoop?
How do I download and install spark?
Can you overwrite Hadoop MapReduce configuration in Hive?
What is the problem in having lots of small files in hdfs?
What is a DStream?