Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is Replication Factor in Cassandra?
What are the the issues associated with the map and reduce slots based mechanism in mapReduce?
Why do we need sparkcontext?
How can you configure remote metastore mode in Hive?
What are the different types of partitioners in cassandra? Explain.
What are the main features and Characteristics of Hadoop which makes it the most popular and powerful Big Data tool?
What are brokers in kafka?
How does gossip protocol help in failure detection?
Can rdd be shared between sparkcontexts?
What are the different modes in which we can configure/install Hadoop?
What is write ahead log(journaling)?
What are the various programming languages supported by Spark?
what are the different modes of Hive?
What is CONCATENATE command in Hive?
Explain the difference between nas and hdfs?