Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is a bag in apache pig?
How does job tracker schedule a job for the task tracker?
What are the differences between a node, a cluster, and datacenter in Cassandra?
What are the characteristics of hadoop framework?
How to insert records in apache tajo?
Explain count_star?
What is kafka technology?
What is the roadmap for apache mahout version 1.0?
How can you remove the elements with a key present in any other RDD?
Write a Hive UDF that returns a sentiment score. For example, if good = 1, bad = -1, and average = 0, then a review of a restaurant states "Good food, bad service," your score might be 1 - 1 = 0.
Explain the difference between gen1 and gen2 hadoop with regards to the namenode?
What is cluster in Cassandra?
What are the log files of the presto server?
How to sort intermediate output based on values in MapReduce?
Explain how does hbase actually delete a row?