Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How does one create RDDs in Spark?
Use of Codegen command in Hadoop sqoop?
What does ambari shell can provide?
What is a databricks cluster?
Is JDBC driver enough to connect sqoop to the databases?
How to change a number of mappers running on a slave in MapReduce?
Which one is the master node in HDFS? Can it be commodity hardware?
Mention what is the benefits of apache kafka over the traditional technique?
What is off heap memory in spark?
Does spark use hive?
What is the difference between TextInputFormat and KeyValueInputFormat class?
Features of Kafka Stream?
How to debug Hadoop code?
What are core components of Flume?
What do you understand by worker node?