Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the role of Consumer API?
What are the actions in spark?
What are the benefits of Spark lazy evaluation?
What are mapreduce new and old apis while writing map reduce program?. Explain how it works
What makes Apache Spark good at low-latency workloads like graph processing and machine learning?
What are the relational operators available related to combining and splitting in pig language?
Does spark use yarn?
Name the three layers, Ambari supports?
How can we assure that the values regarding a particular key goes to the same reducer?
What is the NameNode port number?
What is a partitioner and how the user can control which key will go to which reducer?
What are the core methods of a Reducer?
What are the tools that are used in ambari monitoring?
Explain the overview of hadoop history breifly?
Can you explain spark mllib?