Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Mention the difference between hbase and relational database?
What do you mean by column family in Cassandra?
What is the difference between hadoop and spark?
Can you explain smb join in hive?
What is in memory processing in spark?
Explian the Limitations of HBase?
What are barriers?
What are the features of RDD, that makes RDD an important abstraction of Spark?
What are all stats classes in the org.apache.pig.tools.pigstats package?
What are the file formats that Hive supports and can use be used for storage?
How Hive distributes the rows into buckets?
Why do we need hdfs?
Define the roles of the file system in any framework?
What is an rdd?
What is hbase fsck?