Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is the use of checkpoints in spark?
How to use hdfs put command for data transfer from flume to hdfs?
How does cassandra perform read operation? Explain
How to write a Custom Key Class?
What is kafka in hadoop?
On what basis name node distribute blocks across the data nodes in HDFS?
Mention what is the difference between apache kafka and apache storm?
What are the various libraries available on top of Apache Spark?
What do you understand by an inner bag and outer bag in Pig?
What are the differences between a node, a cluster, and datacenter in Cassandra?
Explain what is a column family in cassandra?
Clarify how job tracker schedules an assignment?
Why is apache spark so fast?
What happens when a datanode fails ?
What is the use of dataframe in spark?