Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Compare Hadoop and Spark?
List of some best tools that can be useful for data-analysis?
What is the difference between HDFS block and input split?
Explain Creating an Index?
What do you understand by the term snitch in cassandra? Give some example.
What is the difference between Gen1 and Gen2 Hadoop with regards to the Namenode?
What is the use of binstorage?
What is the maximum number of rows in a table?
How does apache flume work?
How a task is scheduled by a jobtracker?
What MapReduce framework consists of?
How does an hadoop application look like or their basic components?
What are Features of Hive?
Should I install spark on all nodes of yarn cluster?
What is struct and explain its purpose?