Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are features of apache spark?
What are the advantages of pig language?
What is document store db? Explain with an example.
How does Cassandra perform write function?
What is the difference between kafka and flume?
What exactly kafka does?
Why is BlinkDB used?
how can you check whether Namenode is working beside using the jps command?
Can you explain smb join in hive?
Which is the best hadoop certification?
What is HDFS High Availability?
Define replication strategy?
How does hdfs ensure information integrity of data blocks squares kept in hdfs?
Which interface needs to be implemented to create Mapper and Reducer for the Hadoop?
Can flume provide 100% reliability to the data flow?