Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Define a worker node?
What problem does Apache Pig solve?
Is the keyword ‘DEFINE’ as a function name?
How can you store the data in spark?
Describe coalesce() operation. When can you coalesce to a larger number of partitions? Explain.
How to start hbase services?
What is a bookie in bookkeeper?
What is the process for starting a Kafka server?
What is apache spark engine?
Explain about the replication and multiplexing selectors in Flume?
How many numbers of reducers run in Map-Reduce Job?
What does the high availability of a name-node means? How is it accomplished?
What do you mean by the NameNode High Availability in hadoop?
What is Combiner in MapReduce?
What is spark vcores?