Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How do I try impala out?
What are the disservices of utilizing Apache Spark over Hadoop MapReduce?
How do I get better performance with spark?
What is the difference between spark ml and spark mllib?
What are the benefits of block transfer?
What is the point of apache spark?
Explain cap theorem?
Where the mapper's intermediate data will be stored?
Which directory does hadoop install to?
What is LazyOutputFormat in MapReduce?
How many types of ambari repositories are available?
Is it possible to share data files between different components?
Who created spark?
How will you backup an HBase cluster?
how Cassandra writes data?