Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is the full form of fsck?
Suppose Hadoop spawned 100 tasks for a job and one of the task failed. What will Hadoop do?
What will happen in case you have not issued the command?
How to write a Custom Key Class?
What MapReduce framework consists of?
Explain HCatLoader APIs?
What do we mean by Paraquet?
How will you list all the columns of a table using Apache Sqoop?
What are the different components of a Hive query processor?
What is a task tracker?
Explain how do ‘map’ and ‘reduce’ works?
Can you define yarn?
What is flatten in pig?
How many daemon processes run on a hadoop cluster?
How is RDD in Apache Spark different from Distributed Storage Management?