What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?
40Post New Apache HDFS Hadoop Distributed File System Questions
Since the data is replicated thrice in hdfs, does it mean that any calculation done on one node will also be replicated on the other two?
Explain the hdfs architecture and list the various hdfs daemons in hdfs cluster?
What is Hadoop Distributed File System- HDFS?
What are the main hdfs-site.xml properties?
What is a task tracker?
How does hdfs get a good throughput?
How NameNode tackle Datanode failures in HDFS?
How can one copy a file into HDFS with a different block size to that of existing block size configuration?
What should be the HDFS Block size to get maximum performance from Hadoop cluster?
How data or a file is written into hdfs?
Explain what is difference between an input split and hdfs block?
What is NameNode and DataNode in HDFS?
Why is block size set to 128 MB in HDFS?
Explain how indexing in hdfs is done?
What is the importance of dfs.namenode.name.dir in HDFS?