Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What does it mean by Columnar Storage Format?
Can you explain data versioning?
JMX stands for?
Why HDFS stores data using commodity hardware despite the higher chance of failures?
What are the primary phases of a Reducer?
Explain Zookeeper Leader election?
Define paired RDD in Apache Spark?
State syntax of the command that is used to drop a partition?
What are the operating systems supported by Apache Ambari?
How can we create a hadoop cluster from scratch?
Can you explain about the indexing process in hdfs?
What does connector api in kafka?
Explain how can you debug hadoop code?
What do you mean by Stream Processing in Kafka?
Mention key components of Hive Architecture?