Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How do I start a spark master?
What are the different Primitive Data Types available in Hive?
What is a block in HDFS, why block size 64MB?
What are znodes?
How would you check whether your NameNode is working or not?
What is HBase?
Differentiate between FileSink and FileRollSink?
What is apache mahout?
How does rdd work in spark?
Differentiate between describe and describe extended?
Explain Cqlsh?
When to use –target-dir and when to use –warehouse-dir while importing data?
Explain Usage of Hive?
How do you do a file system check in hdfs?
Why can aggregation not be done in Mapper in MapReduce?