Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is a spark standalone cluster?
What is HBase?
What is identity mapper and chain mapper?
Can we say a COGROUP is a group of more than 1 data set?
What is streaming in Hadoop?
State the difference between Spark SQL and Hql
Explain the process of spilling in Hadoop MapReduce?
What are the important differences between apache and hadoop?
Explain the term paired RDD in Apache Spark?
What is document store db?
Which command is available to show the current HBase user?
What is the roadmap for apache driver version one.0?
What are the other components of ambari that are important for automation and integration?
What are the 2 types of table in hive?
What are the other components of Cassandra?