Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain Hadoop streaming?
What port does spark use?
When is it not recommended to use MapReduce paradigm for large
What problem does Apache Flume solve?
What causes sparks?
What is the use of map transformation?
What is the functionality of Query Processor in Apached Hive ?
How to format the HDFS? How frequently it will be done?
Explain the wordcount implementation via hadoop framework ?
Replication causes data redundancy then why is is pursued in HDFS?
What is a generic udf in hive?
how Cassandra writes changed data into commitlog?
Is bigger than spark driver maxresultsize?
How is recovery achieved in Ambari?
When is the reducers are started in a MapReduce job?