Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is the importance of — the split-by clause in running parallel import tasks in sqoop?
What are main APIs of Kafka?
Name different elements of JConsole?
Explain map-only job?
What are the different tools used for the ambari monitoring purpose?
What is inputformat in hadoop?
What is the difference between DAG and Lineage?
Is HDFS utilized in Cassandra? If yes, where?
What is the use of explode in Hive?
How to perform the inter-cluster data copying work in HDFS?
Explain what is webdav in hadoop?
Explain the different logging levels in cassandra.
What is apache spark and what is it used for?
How can we create table by using command?
Illustrate a simple example of the working of MapReduce.