Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How many job tracker processes can run on a single Hadoop cluster?
Where is the Mapper Output intermediate kay-value data stored ?
Can you explain the term, Cassandra?
What daemons run on master nodes?
What is column store db? Explain with an example.
Explain about the bloommapfile?
Is ambari python client can be used to make good use of ambari api’s?
What is Safemode in Apache Hadoop?
How can an application connect to Hive run as a server?
What is difference between regular file system and HDFS?
Explain when using field grouping in storm, is there any time-out or limit to known field values?
RLIKE in Hive?
Why is spark good?
Whats is distributed cache in hadoop?
How does apache spark engine work?