Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) how you can get exactly once messaging from Kafka during data production?
How many types of NoSQL databases are there?
Explain what is namenode in hadoop?
What is the use of InputFormat in MapReduce process?
What is off heap memory in spark?
Who uses apache spark?
What is the logistic regression?
Does spark use java?
Compare hadoop & spark?
Explain how HDFS communicates with Linux native file system?
When should we use SORT BY instead of ORDER BY?
How can we kill a topology?
Explain schemardd?
What is the FlatMap Transformation in Apache Spark RDD?
Why spark is faster than hive?