Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
CONCAT function in Hive with Example?
Who are ‘Data Scientists’?
What is the roadmap for apache mahout version 1.0?
When is it not recommended to use MapReduce paradigm for large
What is the ZooKeeper ensemble?
What is the use of context object?
What is Small File Problem in Hadoop? How can it be resolved?
What are impala architecture components?
How do we write our own custom serde?
What is a dataset? What are its advantages over dataframe and rdd?
When you should use Hbase?
What does producer api in kafka?
What stored in HDFS?
Explain the LOAD keyword in Pig script?
How do I get better performance with spark?