Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain how can you check whether namenode is working beside using the jps command?
What do you understand by node in cassandra?
What is meant by rdd lazy evaluation?
What are the main components of a Hadoop Application?
What are the different types of UDF's in Java supported by Apache Pig?
what do you mean by the worker node?
Explian the Advantages of HBase?
Is namenode also a commodity?
Define the term ‘sparse vector.’
How much does flume cost?
Why do we use apache kafka?
Explain the hadoop-core configuration?
What are producer-consumer queues?
What is the Reducer used for?
Why HDFS stores data using commodity hardware despite the higher chance of failures?