Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is apache hcatalog?
Describe coalesce() operation. When can you coalesce to a larger number of partitions? Explain.
Define a udf?
What is the difference between persist
What are the actions followed by hadoop?
Explain the term sstables?
When should you use spark cache?
What is flume and sqoop?
Explain various cluster manager in Apache Spark?
What is the default block size in Hadoop 1 and in Hadoop 2? Can it be changed?
What is partitioning key?
Compare Apache Hadoop and Apache Spark?
is HQL case sensitive?
What is the purpose of Hive Driver?
What is the use of explode in Hive?