Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
If a data Node is full how it's identified?
What are the design goals of zookeeper?
How to Delete directory and files recursively from HDFS?
Explain what is difference between an input split and hdfs block?
If I create a folder in HDFS, will there be metadata created corresponding to the folder? If yes, what will be the size of metadata created for a directory?
What is ZooKeeper quorum?
What is spark yarn executor memoryoverhead?
How does hdfs give great throughput?
What is the meaning of speculative execution in Hadoop? Why is it important?
What is ZooKeeper?
What is the InputFormat ?
Where do you specify the Mapper Implementation?
What is RDD Lineage?
What do you understand about yarn?
How Mapper is instantiated in a running job?