Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain the difference between Spark SQL and Hive.
Explain plucktuple?
What platform and java version are required to run hadoop?
Explain the role of Streams API?
What does /etc /init.d do?
How is streaming implemented in spark?
Why can aggregation not be done in Mapper in MapReduce?
Is Mapreduce Required For Impala? Will Impala Continue To Work As Expected If Mapreduce Is Stopped?
Explain schemardd?
If no custom partitioner is defined in Hadoop then how is data partitioned before it is sent to the reducer?
What are the various input and output types supported by mapreduce?
Can NameNode and DataNode be a commodity hardware?
Explain the core methods of the reducer?
Can we change the file cached by distributed cache
How apache spark works?