Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Does the hdfs client decide the input split or namenode?
Can you explain logistic regression?
List the various options available with the Hive command?
What is spark training?
How can you see the list of stored jobs in sqoop metastore?
What is reduce side join in mapreduce?
Why comparison of types is important for MapReduce?
Can we say cogroup is a group of more than 1 data set?
What is the use of spark sql?
Does hadoop follows the unix pattern?
What is sink processors?
What is apache hcatalog?
What is SSTable? How is it different from other relational tables?
Discuss the various running mode of Apache Spark?
What is secondary namenode? Is it a substitute or back up node for the namenode?