Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) State Disadvantages of Apache Kafka?
How is security achieved in Apache Hadoop?
What does rack awareness algorithm means and why is it utilized as a part of hadoop?
Can I set the number of reducers to zero?
How can we create children / sub-znode?
Explain how do you overwrite replication factor?
What is coalesce in spark?
Give me an example of document database ?
Which data storage components are used by hadoop?
How is streaming implemented in spark?
What is a tuple?
What do you understand by mapreduce?
On which port does ssh work?
Explain the concept of cassandra data model?
Can you explain spark graphx?