Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is standalone mode in spark?
What types of costs are associated with creating the index on hive tables?
Explain the functionalities of ganglia in ambari?
Mention some instances where zookeeper is using?
Why is Kafka technology significant to use?
What languages support spark?
Can you explain spark core?
What is apache flume used for?
can you explain about configuration files?
State some advantages of impala?
What is the problem with HDFS and streaming data like logs
List the various HDFS daemons in HDFS cluster?
What is Cassandra Database Software ?
What does a 'MapReduce Partitioner' do?
How to specify more than one path for storage in Hadoop?