Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How do you define a partitioning key?
What is Hadoop serialization?
Explain what is distributed cache in mapreduce framework?
How to load data into table created in hive ?
Does Apache Sqoop have a default database?
What kinds of impala queries or data are best suited for hbase?
What is the definition of Hive?
What is the required action you need to perform if you opt for scheduled maintenance on the cluster nodes?
What is configuration of a typical slave node on Hadoop cluster? How many JVMs run on a slave node?
Who is intended audience to learn HCatalog?
What are "coordinator nodes" in cassandra?
What is Spark?
What is the full form of MSLAB?
What is the jobtracker and what it performs in a hadoop cluster?
Explain what combiners are and when you should use a combiner in a mapreduce job?