What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?
316Post New Hadoop General Questions
What is meant by streaming access?
After increasing the replication level, I still see that data is under replicated. What could be wrong?
Are there any special requirements for namenode?
If no custom partitioner is defined in Hadoop then how is data partitioned before it is sent to the reducer?
What is a commodity hardware? Does commodity hardware include RAM?
What is difference between reducer and combiner?
What is a single point of failure in Hadoop 1 and how is it resolved in Hadoop 2?
What does hadoop-metrics.properties file do?
What does rack awareness algorithm means?
What does hadoop-env.sh do?
What is the non dfs used?
How does job tracker schedule a job for the task tracker?
When and how to create hadoop archive?
What does block mean?
Which directory does hadoop install to?