Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is log compaction?
How can you see the list of stored jobs in sqoop metastore?
What is Output Format in MapReduce?
What is a namenode? How many instances of namenode run on a hadoop cluster?
What are the differences between PIG and SQL?
State some command line options?
What is the difference between spark and hive?
In hbase what is column families?
What are the disadvantages of using Spark?
Explain about the core components of a distributed Spark application?
Give any two features of flume?
What is the difference between dataset and dataframe in spark?
What are the differences between hadoop 1 and hadoop 2?
How can you use producer api code?
What is Shuffling and Sorting in a MapReduce?