Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
how is data partitioned before it is sent to the reducer if no custom partitioner is defined in Hadoop?
What is partioner in hadoop? Where does it run,mapper or reducer?
Do I need to know scala to learn spark?
What is project tungsten in spark?
What is the process to perform an incremental data load in Sqoop?
Is it legal to set the number of reducer task to zero? Where the output will be stored in this case?
Define the common faults of the developer while using apache spark?
What Avro offers?
What is the function of HMaster?
What are “Seed Nodes” in Cassandra?
Explain the difference between mapreduce engine and hdfs cluster?
What is Mapper in Hadoop MapReduce?
Differentiate between hive and hbase?
Difference between external table and internal table in HIVE ?
What is the difference between Cassandra and Hadoop ?