Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Specify the different types of tables accessible in hive?
How hive can improve performance with orc format tables?
Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in hadoop?
What is JMX?
Define actions in spark.
How does apache spark engine work?
How do I change hive execution engine to spark?
How hbase handles the write failure?
What are different logging levels in cassandra?
What is SerDe in Apache Hive ?
What is a tuple?
List out the some common problems faced by data analyst?
When the reducers are are started in a mapreduce job?
How many types of Tables in Hive?
Explain Reliability and Failure Handling in Apache Flume?