Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Why do we use HDFS for applications having large data sets and not when there are lot of small files?
1 2768
What is namenode?
What are configuration files in Hadoop?
Explain HDFS “Write once Read many” pattern?
What are the advantages of using mapreduce with hadoop?
What is the full form of fsck?
Explain Spark join() operation?
What is a Backup node in Hadoop?
Can you explain spark core?
Explain mappartitions() and mappartitionswithindex()?
Explain how do ‘map’ and ‘reduce’ works?
Explain the Scope operators used in hbase?
What are different types of filesystem?
What are the destination types allowed in sqoop import command?
What are the independent extensions that contributed to the ambari codebase?
What do you mean by the High Availability of a NameNode in Hadoop HDFS?