Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the default maximum dynamic partition that can be created by a mapper reducer? How can you change it?
Clarify about the smb join in hive?
How to create custom key and custom value in MapReduce Job?
Can hadoop handle streaming data?
Why do we need Pig?
What are 'slaves' and 'masters' in Hadoop?
Explain small file problem in hadoop
Explain briefly what is Action in Apache Spark? How is final result generated using an action?
What are active and passive "NameNodes"?
What are the four basic parameters of a reducer?
What is ganglia is used for in ambari?
how Cassandra delete Data?
Explain the difference between Spark SQL and Hive.
What are some typical functions of Job Tracker?
Is spark built on top of hadoop?