Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is the task of Spark Engine
Describe Spark SQL?
What is Balancer in Hadoop?
How can an application connect to Hive run as a server?
How does pig work?
What is fsck?
Did you ever ran into a lop sided job that resulted in out of memory error
Differentiate between describe and describe extended?
What is partitioner spark?
What is the difference between NAS and HDFS?
What is map in spark?
How can we create znodes?
Explain what is zookeeper in kafka? Can we use kafka without zookeeper?
Clarify what is sequence file input format?
What is check pointing in hadoop?