Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How does hdfs give great throughput?
Are there any special requirements for namenode?
How is security achieved in Hadoop?
Is hadoop the future?
What is the use of flatmap in spark?
Tell me about the types of hbase operations?
While installing, why does apache have three config files - srm.conf, access.conf and httpd.conf?
How do I load a big csv file into a partitioned table?
Is hadoop mandatory for spark?
What is the main difference between kafka and fume?
How can we see all the clusters that are available in Ambari?
Why is apache spark so fast?
How Pig differs from MapReduce?
Explain Data Locality in Hadoop?
What are the Optimizations a developer can use during joins?