Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is pig properties?
What is SSTable? How is it different from other relational tables?
Explain what is Hive?
What is session in Cassandra?
What are Paired RDD?
What is a MapFile?
What is python stress test in cassandra?
Explain the concept of resilient distributed dataset (rdd).
Detail description of the Reducer phases?
What is the key- value pair in Hadoop MapReduce?
What does rack awareness algorithm means and why is it utilized as a part of hadoop?
What are the main features of hdfssite.xml?
In ambari 2.6.2 version added the following features:
What is in memory in spark?
Why HDFS performs replication, although it results in data redundancy?