Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How to perform the inter-cluster data copying work in HDFS?
What are the four essential parameters of a mapper?
What is namenode?
What types of costs are associated in creating index on hive tables?
What is Apache Spark Streaming?
Define a datanode?
On what all basis can you differentiate rdd, dataframe, and dataset?
What are the disadvantages of using Spark?
What is difference between map and flatmap?
Explain the general mapreduce algorithm
What are Prerequisites to learn Avro?
What are the main hdfs-site.xml properties?
Does cassandra support acid tractions?
How to optimize Hadoop MapReduce Job?
Does spark work with python 3?