Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is structured data?
Describe Network Topology Strategy?
What is spark context spark session?
What does apache mahout do?
In MapReduce, ideally how many mappers should be configured on a slave?
Can you explain benefits of spark over mapreduce?
What are the main features of hdfssite.xml?
What is the process of creating an Ambari client?
How to save RDD?
What is Cassandra-CQL collection?
Explain Tombstone in Cassandra?
What is difference between reducer and combiner?
What is Fault Tolerance?
Is spark based on hadoop?
Why do we need Pig?