Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Does spark need yarn?
Why do we use Hadoop?
Define "Action" in Spark
What is broadcast variable?
What is the difference between apache mahout and spark mllib ?
Which one is better hadoop or spark?
What is the need for Spark DAG?
Why does the picture of Spark come into existence?
Explain the features of pseudo mode?
What is the difference between Cassandra and Hadoop ?
What is the difference between Spark Transform in DStream and map ?
What are the various data types in presto?
Describe JMX?
How can we launch Spark application on YARN?
How hadoop mapreduce works?