Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the different tools used for the ambari monitoring purpose?
What is Writable & WritableComparable interface?
What do you understand by mapreduce?
What do sorting and shuffling do?
What is a block in HDFS, why block size 64MB?
What is contextual routing in flume?
What do masters consist of?
Is spark based on hadoop?
On what basis name node distribute blocks across the data nodes in HDFS?
What is number of executors in spark?
Mention what does the shell commands “capture” and “consistency” determines?
Can you list down the limitations of using Apache Spark?
What language is apache spark?
How can you create topic in kafka?
Explain cap theorem?