Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is combiner aggregator?
What is the problem with the small file in Hadoop?
When you point a partition of a hive table to a new directory, what happens to the data?
What do you understand about yarn?
Explain why to use hbase?
What is the Repository?
Name the filter which accepts the page size as the parameter in hbase?
What does apache spark do?
What is 'jps'?
Why do we need hdfs?
Which companies are mostly using Hive ?
How does hbase actually delete a row?
What is a Sparse Vector?
what is the traditional method of message transfer?
How does hadoop achieve fault tolerance?