Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the benefits of Spark lazy evaluation?
What are the types of System tools?
Discuss about the different tombstone markers used for deletion purposes in HBase.?
What is a MapReduce Combiner?
Can you explain how to minimize data transfers while working with Spark?
When not to use apache kafka?
How can we control particular key should go in a specific reducer?
What are the components of spark?
Explain tajo configuration files?
What do you understand by worker node?
What is RDD?
Why is Data Block size set to 128 MB in Hadoop?
How can multi-hop agent be set up in Flume?
What are the three types of tombstone markers in hbase?
Clarify what is shuffling in map reduce?