Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How do I set up flume agent?
How many instances of JobTracker can run on a Hadoop Cluser?
What are the additional benefits YARN brings in to Hadoop?
Is namenode also a commodity?
What is the difference between an hdfs block and input split?
What is org.apache.jute package?
What is Geo-Replication in Kafka?
Does spark store data?
What is the difference between sqoop and hive?
What is the role of the namenode?
What is the difference between an RDBMS and Hadoop?
What is the role of “ambari-qa” user?
Why do we need hadoop for big data analytics?
What are the stable versions of Hadoop?
Can you explain record reader?