Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the main features of impala?
What is a MapReduce Combiner?
What is the InputFormat ?
What is Reducer in Hadoop?
Why are we using Flume?
What is Safemode in Apache Hadoop?
Mention the common features in Pig and Hive?
what does the text input format do?
Where is apache spark used?
What is avro format?
What do you understand by worker node?
Is it possible to provide multiple inputs to hadoop? If yes, explain.
How is RDD in Apache Spark different from Distributed Storage Management?
Can you do real-time processing with Spark SQL?
What is Secondary Index in Cassandra ?