Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the Data model, and the hierarchical namespace?
What is TaskTracker?
What causes sparks?
what are the three modes in which Hadoop can be run?
What are the functionalities of jobtracer?
Can we use windows for hadoop?
In case of embedded Hive, can the same metastore be used by multiple users?
What are the presto applications?
What is a rack awareness algorithm and why is it used in hadoop?
What are the port numbers of namenode?
What are the Binary Storage formats supported in Hive?
Which one would you recommend for hbase table design approach – tall-narrow or flat wide?
How does pipe operation writes the result to standard output in Apache Spark?
What is the function of NodeManager?
Define replication strategy?