Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Define Thrift in Apache Cassandra?
What is zookeper?
What are the advantages of using map side join in mapreduce?
Please provide an explanation on DStream in Spark.
What is difference between cache and persist in spark?
Explain the process to trigger automatic clean-up in Spark to manage accumulated metadata.
Define the management tools in Cassandra?
Why does the picture of Spark come into existence?
What is the use of “ResultSet execute(Statement statement)” method?
What does the mapred.job.tracker command do?
How will you backup an HBase cluster?
Explain what is Hive?
What is shuffling and sorting in mapreduce?
Is rdd type safe?
What is spark accreditation?