Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the components of Hive architecture?
What can be optimum value for Reducer?
What are the benefits of NoSQL over relational database?
What is the difference between TextinputFormat and KeyValueTextInputFormat class?
What does rack awareness mean?
What is NoSQL database?
What are the benefits yarn brings in to hadoop?
Wherever (Different Directory) I run the hive query, it creates new metastore_db, please explain the reason for it?
What are the advantages of DataFrame?
What do you mean by inputformat?
Give the differences between the different types of primary keys in cassandra?
What is interactive mode in apache pig?
What is paired rdd in spark?
What are tools available to send the streaming data to hdfs?
Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?