Big Data Interview Questions
Questions Answers Views Company eMail

What are shared variables in spark?

213

What is the future of apache spark?

193

How can I improve my spark performance?

188

What is apache spark architecture?

216

Why spark is faster than hive?

188

What happens if rdd partition is lost due to worker node failure?

306

What is pair rdd in spark?

200

What is difference between cache and persist in spark?

192

Is bigger than spark driver maxresultsize?

217

Does spark use java?

201

How do you process big data with spark?

179

What is a spark shuffle?

210

Why do we need apache spark?

191

How do I optimize my spark code?

199

What is the difference between client mode and cluster mode in spark?

205


Un-Answered Questions { Big Data }

For a Hadoop job, how will you write a custom partitioner?

403


Explain apache kafka?

307


How to create database statement in apache tajo?

5


What is the design architecture of Cassandra?

55


What are the the issues associated with the map and reduce slots based mechanism in mapReduce?

390






How many types of NoSQL databases? Give some examples.

154


What is the task of Spark Engine

234


What do you understand by snitches?

49


What does connector api in kafka?

292


What is UDF in Pig?

407


What is a Speculative Execution in Hadoop MapReduce?

375


Explain fold() operation in spark?

202


what is the default replication factor in HDFS?

682


What is salary of hadoop developer?

392


How do ‘map’ and ‘reduce’ work?

356