Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain how can we check whether namenode is working or not?
How many ways we can create rdd?
Explain the sequence of execution of all the components of MapReduce like a map, reduce, recordReader, split, combiner, partitioner, sort, shuffle.
What are the modes in which Hadoop run?
What is spark vcores?
What is the purpose of JConsole?
Mention what is the difference between apache kafka and apache storm?
How to set which framework would be used to run mapreduce program?
Explain the architecture of Hadoop Pig?
Can we do real-time processing using spark sql?
Is spark faster than hadoop?
How can we see only top 15 records from the student.txt out of100 records in the HDFS directory?
Define the Use of MapReduce?
Types of Data Flow in Flume?
Query language is executed in Cassandra database. Clarify?