Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How is hadoop different from other data processing tools?
how you can reduce churn in ISR? When does broker leave the ISR?
Explain what is the row key?
Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?
Explain about Hadoop file system and processing framework?
what is the difference between order by and sort by in Hive?
State about ZooKeeper WebUI?
Name some independent extensions that contribute to the Ambari codebase?
List of the some best tools that can be useful for data-analysis?
Is hive similar to sql?
Some of the most notable applications of Kafka?
What is spark used for?
Define the management tools in Cassandra?
Would you be able to change the block size of hdfs files?
Differentiate between static and dynamic cql tables.