Apache Spark Interview Questions
Questions Answers Views Company eMail

What is difference between cache and persist in spark?

224

Is bigger than spark driver maxresultsize?

252

Does spark use java?

279

How do you process big data with spark?

225

What is a spark shuffle?

277

Why do we need apache spark?

225

How do I optimize my spark code?

248

What is the difference between client mode and cluster mode in spark?

255

What are transformations in spark?

252

What is driver and executor in spark?

222

Is spark secure?

235

What is executor memory and driver memory in spark?

236

What is the point of apache spark?

230

What is rdd map?

273

What is faster than apache spark?

240


Post New Apache Spark Questions

Un-Answered Questions { Apache Spark }

What is sc parallelize in spark?

245


Write the command to start and stop the spark in an interactive shell?

225


What is the difference between spark ml and spark mllib?

248


Explain Machine Learning library in Spark?

229


What is shuffle spill in spark?

239


What is cluster in apache spark?

270


Describe the distnct(),union(),intersection() and substract() transformation in Apache Spark RDD?

255


What is the standalone mode in spark cluster?

205


Explain the use of broadcast variables

269


Is spark good for machine learning?

256


What is spark catalyst?

258


What are spark stages?

249


What is a "Parquet" in Spark?

252


Explain Spark coalesce() operation?

245


List some use cases where Spark outperforms Hadoop in processing.

237