Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain the key features of hdfs?
Can you explain heartbeat in hdfs?
What is Fault Tolerance in HDFS?
Explain the action count() in Spark RDD?
Can you explain smb join in hive?
Explain the difference between gen1 and gen2 hadoop with regards to the namenode?
How to add column in apache tajo?
Why do we need Pig?
Mention what are the most common input formats defined in hadoop?
What do you know by storage and compute node?
What do you know about keyvaluetextinputformat?
Which query language is used in Cassandra database?
What is gossip protocol in Cassandra?
What is rack-aware replica placement policy?
Where is apache spark used?