Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How does one create RDDs in Spark?
Explain pipe() operation in Apache Spark?
What are the advantages of DataFrame?
How can data transfer be minimized when working with Apache Spark?
Are multiline comments supported in Hive?
How can it help for avoiding costly modeling?
What is keyspace in Cassandra?
Whether the output of mapper or output of partitioner written on local disk?
Can hadoop replace relational database?
Why is Apache Spark faster than Apache Hadoop?
Does Hadoop requires RAID?
Is it possible to use the same metastore by multiple users, in case of the embedded hive?
How is reporting controlled in hadoop?
What is Fault Tolerance in HDFS?
List the languages supported by Apache Spark?