Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the modules that constitute the Apache Hadoop 2.0 framework?
What causes sparks?
Does kafka use hdfs?
How can you compare Hadoop and Spark in terms of ease of use?
Did you ever ran into a lop sided job that resulted in out of memory error
What is the use of map transformation?
What will be the output of cast ('XYZ' as INT)?
Can you explain edge nodes in hadoop?
What is full form of rdd?
What are the features of Standalone (local) mode?
What do you know about Partition in Kafka?
Can we run unix shell commands from hive? Can hive queries be executed from script files? How? Give an example.
When is it suggested to use a combiner in a MapReduce job?
What is a Hive variable? What for we use it?
How does MapReduce framework view its input internally?