Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is Hive ?
What are the similarities and differences between Apache Flume and Apache Kafka?
What happens when two users try to access to the same file in HDFS?
What does job conf class do?
What is shuffle spill in spark?
What are impala architecture components?
Give the sqoop command to see the content of the job named myjob?
What is the purpose of DataNode block scanner?
Can you explain sequence file in hadoop?
What are the steps involved in MapReduce framework?
What is the function of NodeManager?
What is the difference between TextinputFormat and KeyValueTextInputFormat class?
What are nodes and ephemeral nodes?
What is the use of binstorage?
What are the main methods of data transferring in hadoop sqoop?