Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the function of consistency cqlsh command in Cassandra?
Define Partitions?
JMX stands for?
Which is better hadoop or spark?
What is ganglia is used for in ambari?
What is spark in big data?
What is a partitioner and how the user can control which key will go to which reducer?
What problem does Apache Flume solve?
What is a map side join?
What are ‘maps’ and ‘reduces’?
What is difference between hive and spark?
Discuss about the different tombstone markers used for deletion purposes in HBase.?
What is Thrift?
Explain what happens if you alter the block size of a column family on an already occupied database?
Wherever (Different Directory) I run the hive query, it creates new metastore_db, please explain the reason for it?