Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is lazy evaluation and how is it useful?
Is it possible to provide multiple input to Hadoop? If yes then how can you give multiple directories as input to the Hadoop job?
What is the difference between map and reduce?
Hadoop Libraries and Utilities and Miscellaneous Hadoop Applications?
What is the importance of — the split-by clause in running parallel import tasks in sqoop?
Can spark work without hadoop?
What is org.apache.jute package?
How can one copy a file into HDFS with a different block size to that of existing block size configuration?
What is the maximum size of string data type supported by hive? Mention the hive support binary formats.
How much Metadata will be created on NameNode in Hadoop?
What is a bag in apache pig?
What are input format, input split & record reader and what they do?
Name a few import control commands. How can Sqoop handle large objects?
What are the stable versions of Hadoop?
How many instances of tasktracker run on a hadoop cluster?