Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What Platforms Cassandra runs on?
What does the mapred.job.tracker command do?
Define the roles of the file system in any framework?
What is 'Key value pair' in HDFS?
What is difference between map and flatmap in spark?
Explain the term paired RDD in Apache Spark?
What is Reducer in Hadoop?
What is Apache Spark?
Explain the format of an apache kafka message?
Are multiline comments supported in Hive?
Is bigger than spark driver maxresultsize?
In case of embedded Hive, can the same metastore be used by multiple users?
In which location Name Node stores its Metadata and why?
Mention what is HiveServer2 (HS2)?
Explain Creating an Index?