Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What daemons run on master nodes?
What is column store db? Explain with an example.
Does spark use tez?
What are the functionalities of jobtracker?
What happen on the namenode when a client tries to read a data file?
What is dag – directed acyclic graph?
Is it possible to create multiple table in hive for same data?
Difference between order by and sort by in Hive?
Does google use spark?
What do you understand by unit and ()in scala?
How is rdd distributed?
Is there any benefit of learning mapreduce if spark is better than mapreduce?
Which serialization libraries are supported in spark?
What is key-value store db?
How can we create children / sub-znode?