Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is Flatten?
What is your favourite tool in the hadoop ecosystem?
Define the level of parallelism and its need in spark streaming?
What is pig properties?
Explain the uses of PIG?
How can Flume be used with HBase?
When to choose "Internal Table" in Hive?
What is Kundera in Cassandra?
What is the history of apache mahout? Once did it start?
Why MapReduce uses the key-value pair to process the data?
What do you understand by compute and storage nodes?
Explain the flatMap operation on Apache Spark RDD?
What is the InputFormat ?
What do you mean by column family?
What happens if the block in HDFS is corrupted?