Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the various types of shared variable in apache spark?
In Hadoop, which file controls reporting in Hadoop?
How would you import data from MYSQL into HDFS ?
Discuss writeahead logging in Apache Spark Streaming?
What is transformation in spark?
How to use 'foreach' operation in pig scripts?
What is a rack awareness algorithm?
What is JMX?
How can you trigger automatic clean-ups in Spark to handle accumulated metadata?
What do you understand by the term snitch in cassandra? Give some example.
What does a Spark Engine do?
How can you use producer api code?
Can spark be used without hadoop?
What problem does Apache Flume solve?
What is executor in spark?