Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the components of Pig Execution Environment?
How does reducebykey work in spark?
what does the text input format do?
What is action, how it process data in apache spark
Name the types of tunable consistency?
What is a DStream?
How is jmx useful in cassandra?
What is Reducer in MapReduce?
Does Flume provide 100% reliability to the data flow?
How do you write your own custom SerDe ?
Can impala do user-defined functions (udfs)?
What is the replica placement Strategy in Cassandra ?
Define replication factor?
What is the difference between external table and managed table?
Which modes can Hadoop be run in? List a few features for each mode?