Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain HCatInputFormat?
What are the differences between PIG and MapReduce?
What is the use of Combiner?
What is the role of a zookeeper in a kafka cluster?
What is anti-entropy?
What happen on the namenode when a client tries to read a data file?
How we can check hadoop sqoop installed or not in a system?
Is hive an impala requirement?
Is hadoop a memory?
Explain Apache Ambari?
How is it completely different from doing machine learning in r or sas?
How can we create znodes?
Describe the distnct(),union(),intersection() and substract() transformation in Apache Spark RDD?
What are the different commands used to startup and shutdown Hadoop daemons?
What are the side effects of not running a secondary name node?