Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) When creating an RDD, what goes on internally?
How to Delete directory and files recursively from HDFS?
Can spark be used without hadoop?
How do I know how many impala nodes are in my cluster?
Explain Zookeeper Queues?
What square measure the options of apache mahout?
What are the common types of NOSQL data bases ?
What do you understand by the partitions in spark?
What is the throughput?
What happen if the number of the reducer is 0 in MapReduce?
What is the difference between coalesce and repartition in spark?
What are the Difference between MongoDB and Cassandra?
What is map side join?
What are different logging levels in cassandra?
Explain about the core components of a distributed Spark application?