Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Would you be able to change the block size of hdfs files?
Does spark require hadoop?
What are the types of cluster managers in spark?
What is the bag?
What purpose would an engineer use spark?
Difference between hive and impala?
Define Simple Strategy?
Clarify how hive de-serialize and serialize the information?
When a large data set is maintained?
Write a Pig UDF Example ?
What are the main components of MapReduce Job?
Difference between cassandra and mongodb?
Explain the role of offset in kafka?
What is the role of CQLSH?
How to use Hive using the command line and Beeline?