Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
what do you mean by compaction?
What is the difference between hbase and hadoop/hdfs?
What is Apache Spark? What is the reason behind the evolution of this framework?
Mention what is the use of Context Object?
What are Guarantees provided by Kafka?
Can you modify the file present in hdfs?
What can skew the mean?
What is the design architecture of Cassandra?
How to set mappers and reducers for Hadoop jobs?
How a task is scheduled by a jobtracker?
What is Pig Statistics? What are all stats classes in the Java API package available?
What are the relational operators available related to combining and splitting in pig language?
What is the latest version of ambari that is available in the market and what is the feature that they have added in it?
What are the key differences between Pig vs MapReduce?
Which Sorting algorithm is used in Hadoop MapReduce?