Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the difference betwaeen mapreduce engine and hdfs cluster?
List the files associated with metadata in hdfs?
Is it possible to use Apache Spark for accessing and analyzing data stored in Cassandra databases?
What is the replica placement Strategy in Cassandra ?
Can you explain heartbeat in hdfs?
What does the "USE" command in hive do?
What are “Seed Nodes” in Cassandra?
What is JPS? Why is it used in Hadoop?
Input Split & Record Reader and what they do?
What is lambda in spark?
Can Ambari manage multiple clusters?
Is hadoop required for data science?
What is identity mapper and chain mapper?
Can you explain the term, Cassandra?
Why are Replications critical in Kafka?