Hadoop General Interview Questions
Questions Answers Views Company eMail

If datanodes increase, then do we need to upgrade namenode?

337

Which object can be used to get the progress of a particular job

334

Give me examples of unstructured data?

318

Explain the core methods of the reducer?

355

How to handle bad records during parsing?

320

What is the use of context object?

317

What happens if number of reducers are 0?

342

What are the primary phases of the reducer?

318

what is next step after mapper or maptask?

305

What is a rack?

304

Do we require two servers for the namenode and the datanodes?

340

We have already sql then why nosql?

303

What is difference between reducer and combiner?

340

What do you understand by standalone (or local) mode?

292

What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?

316


Post New Hadoop General Questions

Un-Answered Questions { Hadoop General }

What is meant by streaming access?

360


After increasing the replication level, I still see that data is under replicated. What could be wrong?

342


Are there any special requirements for namenode?

300


If no custom partitioner is defined in Hadoop then how is data partitioned before it is sent to the reducer?

313


What is a commodity hardware? Does commodity hardware include RAM?

333


What is difference between reducer and combiner?

340


What is a single point of failure in Hadoop 1 and how is it resolved in Hadoop 2?

356


What does hadoop-metrics.properties file do?

343


What does rack awareness algorithm means?

324


What does hadoop-env.sh do?

310


What is the non dfs used?

380


How does job tracker schedule a job for the task tracker?

340


When and how to create hadoop archive?

316


What does block mean?

320


Which directory does hadoop install to?

360