Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the default level of parallelism in apache spark?
How to remove safemode of namenode forcefully in HDFS?
What are the benefits of Spark lazy evaluation?
What is the difference between hadoop and other data processing tools?
What are the Basics of Hadoop?
What do you know about sequencefileinputformat?
What is hfile ?
Explain the level of parallelism in spark streaming?
What are barriers?
What do you understand by node in cassandra?
Explain about the popular use cases of Apache Spark
What is the role of recordreader in hadoop mapreduce?
What is the difference between store and dumps commands?
How to create a custom key and custom value in MapReduce Job?
What are the three layers where the hadoop components are actually supported by ambari?