Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What do you mean by data locality?
How Spark handles monitoring and logging in Standalone mode?
Name the three layers, Ambari supports?
What is difference between hive and spark?
Why hbase is a schema-less database?
What is Small File Problem in Hadoop? How can it be resolved?
How Facebook Uses Hadoop, Hive and Hbase ?
How many datanodes can run on a single Hadoop cluster?
What is parallelize in spark?
Explain HCatStorer APIs?
What is vectorized query execution?
Can you explain speculative execution?
How can we see all the clusters that are available in Ambari?
what is the default replication factor in HDFS?
Why are we using Flume?