Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Which command is used for the retrieval of the status of daemons running the hadoop cluster?
What are the file formats that Hive supports and can use be used for storage?
What is dag – directed acyclic graph?
Explain the Reducer's reduce phase?
Why is Kafka technology significant to use?
What is the difference between piglatin and hiveql?
What kind of hardware is best for hadoop?
On what basis name node distribute blocks across the data nodes in HDFS?
What is the Physical plan in pig architecture?
What is write ahead log(journaling) in Spark?
What combiners is and when you should use a combiner in a MapReduce Job?
What is the role of “ambari-qa” user?
How is rdd fault?
What is the purpose of sqoop-merge?
What is the row key?