Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Differentiate between piglatin and hiveql?
Can you define a block and block scanner in hdfs?
How to create Users in hadoop HDFS?
What is a udf?
Clarify how ordering in hdfs is finished?
Query language is executed in Cassandra database. Clarify?
What kind of data warehouse application is suitable for Hive? What are the types of tables in Hive?
What are ‘maps’ and ‘reduces’?
What is the abstraction of Spark Streaming?
what is Zookeeper in Kafka? Can we use Kafka without Zookeeper?
How to create custom key and custom value in MapReduce Job?
Why are Replications critical in Kafka?
What is flume and sqoop?
What are the benefits of Spark over MapReduce?
What is data cleansing?