Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the role of the offset.
Difference between cassandra and mongodb?
Can you explain the common input formats in hadoop?
What are the similarities and differences between Apache Flume and Apache Kafka?
How can you remove the elements with a key present in any other RDD?
Explain data flow in Flume?
What is secondary namenode?
What is the difference between scala and spark?
What happens if the number of reducers is 0 in Hadoop?
What are the various functions of Spark Core?
Explain textloader function?
What is hotspotting in hbase?
What is RDD in Apache Spark? How are they computed in Spark? what are the various ways in which it can create?
Explain reduceByKey() Spark operation?
how you can improve the throughput of a remote consumer?