Apache HDFS Hadoop Distributed File System Interview Questions
Questions Answers Views Company eMail

What is the problem in having lots of small files in hdfs?

38

Why rack awareness algorithm is used in hadoop?

30

Can you change the block size of hdfs files?

36

Explain about the indexing process in hdfs?

64

Explain what is difference between an input split and hdfs block?

53

Replication causes data redundancy then why is pursued in hdfs?

41

Since the data is replicated thrice in hdfs, does it mean that any calculation done on one node will also be replicated on the other two?

31

If a particular file is 50 mb, will the hdfs block still consume 64 mb as the default size?

28

What is the difference between an hdfs block and input split?

66

Explain hdfs?

48

Explain the key features of hdfs?

22

How does hdfs get a good throughput?

52

Explain how indexing is done in hdfs?

32

Explain the difference between mapreduce engine and hdfs cluster?

61

Explain the difference between an hdfs block and input split?

55


Post New Apache HDFS Hadoop Distributed File System Questions

Un-Answered Questions { Apache HDFS Hadoop Distributed File System }

What is a Block Scanner in HDFS?

85


Difference Between Hadoop and HDFS?

55


Can multiple clients write into an HDFS file concurrently in hadoop?

58


Does hdfs enable a customer to peruse a record, which is already opened for writing?

26


How does hdfs give great throughput?

28


Explain what is difference between an input split and hdfs block?

53


What is a rack awareness algorithm and why is it used in hadoop?

27


Would you be able to change the block size of hdfs files?

39


How to create directory in HDFS?

44


Explain the difference between an hdfs block and input split?

55


How to keep files in HDFS?

49


What are problems with small files and hdfs?

30


What do you mean by the high availability of a namenode?

22


Replication causes data redundancy and consume a lot of space, then why is it pursued in hdfs?

35


Why is HDFS only suitable for large data sets and not the correct tool to use for many small files?

47