Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Whats is distributed cache in hadoop?
Which type of data HBase can store?
Is apache spark going to replace hadoop?
How mahout used with python ?
Can you explain clustering in mahout?
How does a client read/write data in HDFS?
Is reduce-only job possible in Hadoop MapReduce?
Can MapReduce program be written in any language other than Java?
What are the benefits of setting up a local repository?
What are shared variables in spark?
Why is Kafka technology significant to use?
Explain the top() and takeordered() operation?
Why do we use spark?
What are different hdfs dfs shell commands to perform copy operation?
Is it necessary to kill the topology while updating the running topology?