Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is difference between hadoop and spark?
How client application interacts with the NameNode?
What is SparkContext in Apache Spark?
What is Interceptor?
Is a log flume a roller coaster?
Explain Erasure Coding in Hadoop?
Define the difference between hive and hbase?
While loading data into a hive table using the load data clause, how do you specify it is a hdfs file and not a local file ?
What is apache spark and what is it used for?
What is spark master?
What is the difference between Hiveserver1 and Hiveserver2?
Explain how do ‘map’ and ‘reduce’ work?
Explain the Scope operators used in hbase?
Can rdd be shared between sparkcontexts?
What is spark good for?