Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
When is it suggested to use a combiner in a MapReduce job?
Name the scalar data type and complex data types in Pig?
How NameNode tackle Datanode failures in Hadoop?
Which among the two is preferable for the project- Hadoop MapReduce or Apache Spark?
Explain what is hadoop?
What is hotspotting in hbase?
What is the use of “void close()” method?
What alternate way does HDFS provides to recover data in case a Namenode, without backup, fails and cannot be recovered?
What is apache spark written in?
What happens when an action is executed in spark?
While starting hadoop services, datanode service is not running?
How to create table in hive for a json input file?
What are the tools that are used in ambari monitoring?
Explain the Reducer's Sort phase?
Explian the Advantages of HBase?