Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is lambda in spark?
What does FOREACH do?
Why comparison of types is important for MapReduce?
What are the exception handling operators in Pig script?
How can you send some messages in kafka?
Mention what is the traditional method of message transfer?
How the Client communicates with HDFS?
Are results returned as they become available, or all at once when a query completes?
What is a hive on spark?
What do you understand by Executor Memory in a Spark application?
What are transformations in spark?
Explain Machine Learning library in Spark?
Mention the date data type in hive. Name the hive data type collection.
What is a bookie in bookkeeper?
When to use explode in Hive?