Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the difference between a node, a cluster, and data centre?
what do you mean by compaction?
What is Yum?
Compare HBase vs RDBMS?
Compare RDBMS with Hadoop MapReduce.
Explain the lookup() operation in Spark?
Why can we not create directory /user/dataflair/inpdata001 when name node is in safe mode?
What are the disadvantages of using Spark?
What happens if the block in HDFS is corrupted?
How does spark rdd work?
Define RDD?
Explain Dsstream with reference to Apache Spark
Which interface needs to be implemented to create Mapper and Reducer for the Hadoop?
How do I download apache mahout?
What do you understand by Executor Memory in a Spark application?