How is hadoop different from other data processing tools?
No Answer is Posted For this Question
Be the First to Post Answer
Why do we use HDFS for applications having large data sets and not when there are lot of small files?
Explain what if rack 2 and datanode fails?
Can you explain how do ‘map’ and ‘reduce’ work?
How does a namenode handle the failure of the data nodes?
Does this lead to security issues?
Can the balancer be run while Hadoop is in use?
What is the difference between HDFS and NAS ?
What is the process to change the files at arbitrary locations in HDFS?
What is zookeeper in hadoop?
What is Hadoop Custom partitioner ?
How Mapper is instantiated in a running job?
What is difference between secondary namenode, checkpoint namenode & backupnod secondary namenode, a poorly named component of hadoop?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)