What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?
40Post New Apache HDFS Hadoop Distributed File System Questions
What do you mean by the high availability of a namenode?
Explain how indexing in hdfs is done?
What are file permissions in HDFS and how HDFS check permissions for files or directory?
What is the command for archiving a group of files in hdfs.
How data or a file is written into hdfs?
Explain the difference between mapreduce engine and hdfs cluster?
How to create directory in HDFS?
What is throughput? How does HDFS get a good throughput?
In HDFS, how Name node determines which data node to write on?
Explain how HDFS communicates with Linux native file system?
What is the difference between an hdfs block and input split?
Explain how are file systems checked in hdfs?
Explain HDFS “Write once Read many” pattern?
What are the main properties of hdfs-site.xml file?
Does the HDFS go wrong? If so, how?