Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Define Cluster?
What happen if number of reducer is set to 0 in Hadoop?
What is Data Log in Kafka?
How to write a custom partitioner for a Hadoop MapReduce job?
How can we create znodes?
What is write ahead log(journaling) in Spark?
What is mlib in apache spark?
What is Streaming / Log Data?
How many daemon processes run on a hadoop cluster?
Is hadoop the future?
How should 'load' keyword is useful in pig scripts?
What is the problem with the small file in Hadoop?
Explain about the different channel types in Flume. Which channel type is faster?
What are the consistency levels for read operations in Cassandra?
What are file permissions in HDFS and how HDFS check permissions for files or directory?