Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the languages supported by apache spark and which is the most popular one?
What is difference between dataset and dataframe in spark?
What are partitions and tokens in cassandra?
Explain the Scope operators used in hbase?
How is hadoop different from spark?
What is the function of consistency cqlsh command in Cassandra?
What is NameNode and DataNode in HDFS?
What are the uses of explode hive?
Mention the common features in Pig and Hive?
If datanodes increase, then do we need to upgrade namenode?
Say when to pick “inward table” and “outside table” in hive?
Which command is available to show the current HBase user?
What is Partition table in Hive?
What is difference between flume and sqoop?
What are the 2 modes used to run pig scripts?