Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the difference between Gen1 and Gen2 Hadoop with regards to the Namenode?
what is the maximum size of the message does Kafka server can receive?
What is azure spark?
Is it possible to provide multiple input to Hadoop? If yes then how?
Whether the output of mapper or output of partitioner written on local disk?
is it posible to join multiple fields in pig scripts?
How can we remove a znode?
What is the role of the offset.
What are the different components of a Hive architecture?
What are pig scripts?
What role does worker node play in Apache Spark Cluster? And what is the need to register a worker node with the driver program?
How client application interacts with the NameNode?
How are sparks created?
Explain bagtotuple?
How can you schedule a sqoop job using Oozie?