Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Command to format the NameNode?
Explain a common use case for Flume?
Can hbase run without hadoop?
Which one is the master node in HDFS? Can it be commodity hardware?
What are the benefits of NoSQL over relational database?
In which kind of scenarios MapReduce jobs will be more useful than PIG in Hadoop?
Why Ambari?
What is the use of checkpoints in spark?
Clarify how hive de-serialize and serialize the information?
How hbase handles the write failure?
What are the core methods of a Reducer?
Does spark replace hadoop?
Is there an easy way to expire a session for testing?
Why we use BloomMapFile?
What is a flume agent?