Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is spark built on top of hadoop?
What is the meaning of compaction in hbase?
Describe JMX?
What is the significance of ‘IF EXISTS” clause while dropping a table?
By Default, how many partitions are created in RDD in Apache Spark?
Explain what are the basic parameters of a mapper?
According to IBM, what are the three characteristics of Big Data?
What is the difference between spark and python?
Name types of Cluster Managers in Spark.
how you can reduce churn in ISR? When does broker leave the ISR?
Explain different execution modes available in Pig?
What do you understand by the term snitch in cassandra? Give some example.
What is a Backup node in Hadoop?
What is the difference between rdd and dataframe in spark?
Should we use RAID in Hadoop or not?