Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is active and passive NameNode in HDFS?
Is hadoop based on google mapreduce?
What is a rack?
What problem does Apache Pig solve?
What is a UDF in Pig?
What do you understand by the super column in cassandra?
What roles do Replicas and the ISR play?
what is Bloom Filter is used for in Cassandra?
How does pig work?
Explain how to write the output into a file using storm?
Can I do trforms or add new functionality?
Define sparksession in apache spark? Why is it needed?
What are the network requirements for using hadoop?
How message is consumed by consumer in kafka?
What are the steps involved in MapReduce framework?