Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can multiple clients write into an HDFS file concurrently in hadoop?
What is the latest version of ambari that is available in the market and what is the feature that they have added in it?
Name some AVRO Reference APIs?
What is pseudo-distributed mode?
What is the use of InputFormat in MapReduce process?
Why hive does not store metadata information in hdfs?
Name different types of NoSQL database?
Explain tobag function?
Do we need hadoop for spark?
How can Flume be used with HBase?
What is the process of changing the split size if there is limited storage space on Commodity Hardware?
Mention what are the three modes in which hadoop can be run?
Why do we use ‘filters’ Pig scripts?
How does Mappers run method works?
Explain about the partitioning, shuffle and sort phase