Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the Use of Sqoop?
Is there any point of learning mapreduce, then?
What is contextual routing in flume?
When do you have to avoid secondary indexes?
Define Nodetool Utility?
Explain the term memtable?
Differentiate between hive and hbase?
What is spooldir flume?
Explain the processing speed difference between Hadoop and Apache Spark?
Why HDFS performs replication, although it results in data redundancy?
What bit version that ambari needs and also list out the operating systems that are compatible?
What is the relationship between Job and Task in Hadoop?
What is the use of "order by" in Hive?
What does the high availability of a name-node means?
Do we need to install scala for spark?