Big Data Interview Questions
Questions Answers Views Company eMail

Explain how does hadoop classpath plays a vital role in stopping or starting in hadoop daemons?

409

Explain what is storage and compute nodes?

268

Explain how jobtracker schedules a task?

255

Explain what happens when hadoop spawned 50 tasks for a job and one of the task failed?

258

What happens in text format?

238

Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in hadoop?

266

Explain what is a task tracker in hadoop?

223

Explain what is speculative execution?

219

Explain what happens in text format?

304

Mention what is distributed cache in hadoop?

233

Explain how can you debug hadoop code?

247

Explain how can you check whether namenode is working beside using the jps command?

250

Explain what is webdav in hadoop?

279

Mention what job does the conf class do?

239

For using hadoop list the network requirements?

221


Un-Answered Questions { Big Data }

What is spark table?

158


What are the configuration files in Hadoop?

249


When is it not recommended to use MapReduce paradigm for large scale data processing?

391


What is NoSQL?

648


Is hadoop a database?

407






Differentiate between the terms: node, a cluster, and data center in cassandra?

59


Which all languages Apache Spark supports?

240


What is the use of cloudera?

228


What is spark client?

189


Can you define data lake?

221


What is the purpose of ‘dump’ keyword in Pig?

363


How can we scale apache mahout in cloud?

35


Does spark store data?

201


What is the need for custom serde?

450


What are the side effects of not running a secondary name node?

282