Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is a namenode in hadoop?
What are durable writes?
Define catalog tables in HBase?
What do you understand by the super column in cassandra?
How we can take Hadoop out of Safe Mode?
What are the most memory-intensive operations?
What are the uses and applications of mahout ?
Explain countByValue() operation in Apache Spark RDD?
Define a metadata?
What do slaves consist of?
How do users interact with HDFS in Apache Pig ?
What is the role of recordreader in hadoop mapreduce?
What is faster than apache spark?
How is a keyspace created in cassandra? & What are the parameters used?
What is partitioning key?