Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is job tracker in Hadoop?
If you run hive as a server, what are the available mechanism for connecting it from application?
What is an "RDD Lineage"?
How to enable trash/recycle bin in hadoop?
Can we run Apache Spark without Hadoop?
Can we submit the mapreduce job from slave node?
Explain reduceByKey() Spark operation?
What is hdfs block size?
Explain keys() operation in Apache spark?
Give any two features of flume?
What is spark sqlcontext?
What are the hadoop's three configuration files?
Differentiate between the physical plan and logical plan in Pig script?
What is data skew and how do you fix it?
How should 'load' keyword is useful in pig scripts?