Topic :: Apache Spark





Apache Spark Interview Questions
Questions Answers Views Company eMail

Name three companies which is used Spark Streaming services

254

Explain the default level of parallelism in Apache Spark

251

Explain the process to trigger automatic clean-up in Spark to manage accumulated metadata.

230

Can You Use Apache Spark To Analyze and Access Data Stored In Cassandra Databases?

224

What is Spark SQL?

236

Can you explain how to minimize data transfers while working with Spark?

446

What are the ways to launch Apache Spark over YARN?

211

is it necessary to install Spark on all nodes while running Spark application on Yarn?

223

What is a worker node in Apache Spark?

223

What is worker node in Apache Spark cluster?

225

What is action, how it process data in apache spark

252

What is sparkContext?

221

Name various types of Cluster Managers in Spark.

239

How much faster is Apache spark than Hadoop?

226

Difference between groupByKey vs reduceByKey in Apache Spark?

348




Un-Answered Questions { Apache Spark }

What are Apache Spark, Flume, Lucene, Hama, HCatalog, Mahout, Drill, Crunch and Thrift?

385


Hadoop uses replication to achieve fault tolerance. How is this achieved in Apache Spark?

321


Does Apache Spark provide check pointing?

335


Explain about the popular use cases of Apache Spark

364


Why is Apache Spark faster than Apache Hadoop?

495






Compare Apache Hadoop and Apache Spark?

272


What is Apache Spark?

218


explain the key features of Apache Spark?

230


How is Apache Spark better than Hadoop?

241


Explain the term paired RDD in Apache Spark?

279


Which all languages Apache Spark supports?

263


explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in Apache Spark.

326


What are the types of Apache Spark transformation?

219


Why Apache Spark?

240


Explain transformation and action in RDD in Apache Spark?

219