Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Suppose Hadoop spawned 100 tasks for a job and one of the task failed. What will Hadoop do?
What is illustrate used for in apache pig?
What is a spark shuffle?
What is isr?
what is the difference between order by and sort by in Hive?
A number of combiners can be changed or not in MapReduce?
Is it necessary to install spark on all the nodes of a YARN cluster while running Apache Spark on YARN ?
Difference between mapreduce and spark
What is executor memory in a spark application?
What is a partition in Hive?
Name commonly-used Spark Ecosystems
What happens when two clients try to access the same file on HDFS?
Explain the features of pseudo mode?
Explain count_star?
What is HDFS - Hadoop Distributed File System?