Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How will you perform the inter cluster data copying work in hdfs?
How much is flume worth?
What is a broker?
What is a bag in Pig Latin?
Define functions of SparkCore?
How will you calculate the number of executors required to do real-time processing using Apache Spark? What factors need to be considered for deciding on the number of nodes for real-time processing?
What is SSTable?
Explain HCatLoader and HCatStorer APIs?
Why use hadoop?
How can we have to see all the clusters that are available in ambari?
How can I import large objects (BLOB and CLOB objects) in Apache Sqoop?
What is pipelined rdd?
Who should learn Apache Ambari?
Define "Action" in Spark
What MapReduce framework consists of?