Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Does kafka use hdfs?
How to identify that given operation is transformation/action in your program?
What is the sequencefileinputformat in hadoop?
What is Distributed Cache in Hadoop?
What is pregel api?
Differentiate between GROUP and COGROUP operators?
What is the Use of Sqoop?
Are multiline comments supported in Hive?
How is indexing done in HDFS?
Can you explain how you can use Apache Spark along with Hadoop?
Name a few import control commands. How can Sqoop handle large objects?
Is kafka a amqp?
Mention what daemons run on a master node and slave nodes?
Is Apache Spark a good fit for Reinforcement learning?
State syntax of the command that is used to drop a partition?