Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is a hive on spark?
What are the three types of tombstone markers in hbase?
What is a "worker node"?
Can I do trforms or add new functionality?
When should you use spark cache?
why should we use 'filters' in pig scripts?
What are the three layers where the hadoop components are actually supported by ambari?
What is Flatten?
What are the features and characteristics of Apache Spark?
How to submit extra files(jars, static files) for Hadoop MapReduce job during runtime?
What are tokens in cassandra?
Explain Spark SQL caching and uncaching?
What are barriers?
What happens in text format?
Why is there a need for broadcast variables when working with Apache Spark?