Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Mention what is the difference between hdfs and nas?
What is scala spark?
What do you mean by taskinstance?
Mention some basic tajo shell commands?
how JobTracker schedules a task ?
List few differences between apache kafka and rabbitmq?
What are the benefits of apache kafka over the traditional technique?
What is bucketing ?
What are the all tasks we can perform for managing services using the ambari service tab?
Specify the different methods of hive?
Compare rdbms with hbase?
What is in memory processing in spark?
What are the common mistakes developers make when running Spark applications?
What is log compaction?
Explain HCatLoader APIs?