What should be the ideal replication factor in Hadoop Cluster?
Answers were Sorted based on User's Feedback
Define streaming access?
What Mapper does?
How to change Replication Factor For below cases ?
What are the default configuration files that are used in hadoop?
Which data storage components are used by hadoop?
How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
What is the default replication factor?
Which one is default InputFormat in Hadoop ?
What is a checkpoint?
explain Metadata in Namenode?
What's the best way to copy files between HDFS clusters?
What is the communication channel between client and namenode/datanode?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)