Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is a generic udf in hive?
How does data transfer happen from hdfs to hive?
Explain Tombstone in Cassandra?
Can you explain heartbeat in hdfs?
Differentiate between the various types of primary keys in cassandra.
What is Sparse Vector?
Hadoop uses replication to achieve fault tolerance. How is this achieved in Apache Spark?
How does rdd work in spark?
List the benefits of Spark over MapReduce.
why use hcolumndescriptor class?
Explain about the data model operations in HBase?
How does spark work with python?
Describe Network Topology Strategy?
What is the best practice on deciding the number of column families for HBase table?
What is row in hbase?