Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the primary purpose of flume in the hadoop architecture?
How does broadcast join work in spark?
Can you explain record reader?
There seem to be certain management tools in Cassandra. What are they?
when do reducers play their role in a mapreduce task?
What are impala architecture components?
What is a checkpoint?
State the difference between persist() and cache() functions.
How HCatalog helps to capture processing states to enable sharing?
What is Flume?
Explain deletion in hbase?
When the reducers are are started in a mapreduce job?
How can one write custom record reader?
What is the Use of SSH in Hadoop ?
What are the actions in spark?