Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What services run after running hbase job?
Different running modes for running Pig?
Explain jmx concerning hbse
What are the various libraries available on top of Apache Spark?
Name a few companies that use Apache Spark in production?
Wherever (Different Directory) I run the hive query, it creates new metastore_db, please explain the reason for it?
How do I start flume agent?
What is shuffleing in mapreduce?
Where is spark used?
What is the unit of data that flows through a flume agent?
Explain the use of broadcast variables
What is hive metastore?
How to create index statement in apache tajo?
What is serialization in spark?
What is catalyst framework in spark?