Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is bookkeeper?
Cassandra is written in which language?
Why we use BloomMapFile?
Why Flume?
Explain transformation in rdd. How is lazy evaluation helpful in reducing the complexity of the system?
Detail description of the Reducer phases?
What can I do with my m&s sparks points?
What is bag?
What does adminclient api in kafka?
Is hadoop based on google mapreduce?
Where is spark used?
How will you design or modify schema in hbase programmatically?
What does the Spark Engine do?
While installing, why does apache have three config files - srm.conf, access.conf and httpd.conf?
How do we represent data in Spark?