Ideally what should be the replication factor in hadoop?
No Answer is Posted For this Question
Be the First to Post Answer
Mention what is distributed cache in hadoop?
Why use hadoop?
What are the features of Fully-Distributed mode?
What do you understand by standalone (or local) mode?
If no custom partitioner is defined in Hadoop then how is data partitioned before it is sent to the reducer?
How is the splitting of file invoked in Hadoop framework?
Are job tracker and task trackers present in separate machines?
Can you explain record reader?
How can one write custom record reader?
What are the core components of Hadoop?
What happens when two clients try to access the same file in the hdfs?
Web-ui shows that half of the datanodes are in decommissioning mode. What does that mean? Is it safe to remove those nodes from the network?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)