Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What do you mean by tunable consistency?
What is the ZooKeeper ensemble?
How the Client communicates with HDFS?
What is Data Locality in Hadoop?
What are the common mistakes developers make when running Spark applications?
How does lazy evaluation work in spark?
What is available mechanism for connecting from applications, when we run hive as a server?
Can hbase run without hadoop?
How many types of nosql databases?
How will format the HDFS ?
Explain map-only job?
How to set the number of reducers?
What is the default extension of the files produced from a sqoop import using the –compress parameter?
What is hdfs spark?
What is Rack Awareness? What is its need in Hadoop?