Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the difference between apache mahout and spark mllib ?
Describe HDFS Federation?
What are the features of apache cassandra?
Explain HCatLoader APIs?
What does secondary name-node means?
How to write 'foreach' statement for tuple datatype in pig scripts?
How is impala metadata managed?
When to use Cassandra?
How does cassandra perform read operation? Explain
What are the side data distribution techniques?
Does the HDFS go wrong? If so, how?
what is the traditional method of message trfer?
What is the problem in having lots of small files in hdfs?
Why spark is faster than hive?
If DataNode increases, then do we need to upgrade NameNode?