Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How many ways we can create rdd in spark?
What happens to a NameNode that has no data?
What are the differences between PIG and MapReduce?
Which classes are used by the hive to read and write hdfs files?
How can you start the kafka server?
On which port does ssh work?
Define commit log?
What is pig latin statements?
Explian the Limitations of HBase?
What are main APIs of Kafka?
How apache spark works?
Can you explain how you can use Apache Spark along with Hadoop?
What is Spark Driver?
What are the various configuration parameters required to run a mapreduce job?
What are Pig Execution modes?