Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is kafka a message queue?
What are the benefits of Spark lazy evaluation?
Which is the best spark certification?
Mention what is rack awareness?
What are the various components in kafka.
Define the Use of Pig?
Explain the uses of PIG?
How to write a custom partitioner for a Hadoop MapReduce job?
The partition of hive table has been modified to point to a new directory location. Do I have to move the data to the new location or the data will be moved automatically to the new location?
Name all HCatalog Features?
Why is HDFS only suitable for large data sets and not the correct tool to use for many small files?
What is the man difference between hbase and hive?
What ensures load balancing of the server in Kafka?
What do you mean by metadata in HDFS? Where is it stored in Hadoop?
On which hosts does impala run?