What are the configuration files in Hadoop?
What are the benefits yarn brings in to hadoop?
what is next step after mapper or maptask?
What is the problem with small files in Hadoop?
Web-ui shows that half of the datanodes are in decommissioning mode. What does that mean? Is it safe to remove those nodes from the network?
What is namenode?
Where are Hadoop’s configuration files located?
Define a udf?
Is Namenode machine same as DataNode machine as in terms of hardware in Hadoop?
Why Hadoop performs replication, although it results in data redundancy?
What is a record reader?
Why is Data Block size set to 128 MB in Hadoop?
Can you define udf?
Clarify how job tracker schedules an assignment?
According to IBM, what are the three characteristics of Big Data?