Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How do you run the pig scripts in local mode?
How does lazy evaluation work in spark?
What file systems does spark support?
Why do we need Hadoop Archives? How is it created?
What is data pipeline in spark?
Explain what happens in text format?
What are the different Data Types available in Hive?
Define compaction in HBase?
Which command do we use to show the version?
What kind of hardware is best for hadoop?
What is Streaming / Log Data?
What is hinted handoff?
What do you understand by the super column in cassandra?
What are the primary phases of a Reducer?
Can you explain worker node?