Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain the two types of compactions in Cassandra.
In hadoop_pid_dir, what does pid stands for?
How can you remove the elements with a key present in any other RDD?
What is a bloom filter?
Does spark load all data in memory?
Can you explain benefits of spark over mapreduce?
What is difference between spark and kafka?
List some benefits of apache kafka?
what is (HS2) HiveServer2?
What are clusters in cassandra?
How do you write your own custom SerDe ?
What is the difference between Hadoop and Traditional RDBMS?
What is the difference between HDFS block and input split?
Is kafka a message queue?
What is flatmap in apache spark?