Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
If you run a select * query in hive, why does it not run mapreduce?
Explain the filter transformation?
Why do we need rdd in spark?
What is a UDF in Pig?
What is the difference between Job and Task in MapReduce?
Is there any benefit of learning MapReduce, then?
What is the role of the zookeeper?
What is Spark Dataset?
How is spark sql different from hql and sql?
Which command do we use to show the version?
Does apache flume support third-party plugins?
What is the role zookeeper plays in a cluster of kafka?
Is hadoop the future?
Explain tokenize?
What is scala spark?