Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Which language is best for spark?
Is hive similar to sql?
What is catalyst framework in spark?
What are the independent extensions that contributed to the ambari codebase?
What are snapshots and how do you create one in cassandra?
What other technologies have you used in hadoop sta ck?
what is Zookeeper in Kafka? Can we use Kafka without Zookeeper?
Clarify the NoSQL Database?
How namenode handles data node failures?
What is partitioning key?
Name the operating system(s) which are supported for production hadoop deployment?
What is the use of cloudera?
Which spark library allows reliable file sharing at memory speed across different cluster frameworks?
What is the difference between rdd and dataframe?
What do you mean by shuffling and sorting in MapReduce?