Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is apache spark good for?
What are advantages of Spark over MapReduce?
Explain cassandra.
Can I set the number of reducers to zero?
Explain future growth of Apache Ambari?
Hadoop sqoop word came from?
What is having clause in apache tajo?
List commonly used machine learning algorithm?
What is Spark?
What can skew the mean?
How to create a custom key and custom value in MapReduce Job?
Which java class handles the output record encoding into files which result from Hive queries?
Is JDBC driver enough to connect sqoop to the databases?
What is InputFormat in Hadoop MapReduce?
What is difference between map and flatmap?