Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is spark driver application?
Mention some machine learning algorithms exposed by mahout?
Which companies are mostly using Hive ?
Can you define rdd?
What counter in Hadoop MapReduce?
What are Actions?
What Is Difference Between Mapreduce and Pig ?
What are znodes?
if you run Hive as a server?
Is fs.mapr.working.dir a single directory?
What are the config properties of presto?
What do you mean by a bag in Pig?
How to enable/configure the compression of map output data in hadoop?
How do you write your own custom SerDe ?
Which modes can Hadoop be run in? List a few features for each mode?