Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Say when to pick “inward table” and “outside table” in hive?
Explain how to Tune Kafka for Optimal Performance?
Does spark use yarn?
Explain what is distributed cache in mapreduce framework?
What is key-value store db?
What is the logical plan in pig architecture?
Is hadoop still in demand?
How is security achieved in Apache Hadoop?
What do sorting and shuffling do?
List some use cases where classification machine learning algorithms can be used.
What is Hadoop streaming?
What does repartition do in spark?
what does the conf.setMapper Class do ?
Explain the Single point of Failure in Hadoop?
What is tunable consistency in Cassandra?