Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is shark?
HDFS is used for applications with large data sets, not why Many small files?
Why lazy evaluation is good in spark?
Is there a dual table?
What is hadoop sqoop?
Can you define inputsplit in hadoop?
How to specify more than one directory as input in the Hadoop MapReduce Program?
Can you define what is Event Serializer in Flume?
Is secondary namenode a substitute to the namenode?
How can you set an arbitrary number of mappers to be created for a job in Hadoop?
Why Apache Spark?
What is Disk Balancer in Hadoop?
What are accumulators in spark?
Does sqoop use mapreduce?
What are the three components of Cassandra write?