Hadoop Interview Questions
Questions Answers Views Company eMail

What is the driver program in spark?

185

What is spark submit?

190

How do I clear my spark cache?

180

What is a partition in spark?

213

What is spark vectorization?

188

What is off heap memory in spark?

184

What is a tuple in spark?

197

Is spark an etl?

190

How is rdd distributed?

201

What are the common transformations in apache spark?

188

What is the difference between dataset and dataframe in spark?

221

What is distributed cache in spark?

203

What is catalyst framework in spark?

191

How is dag created in spark?

190

What does spark do during speculative execution?

201


Un-Answered Questions { Hadoop }

Why do we need apache spark?

191


Where sorting is done in Hadoop MapReduce Job?

393


Is Hive useful when making data warehouse applications?

453


Does spark use mapreduce?

186


What are the differences between PIG and MapReduce?

355






Does impala support generic jdbc?

35


What is NoSQL?

648


What do you understand by Lazy Evaluation?

209


What is mlib in apache spark?

190


Define ttl in hbase?

122


What are consumers or users?

362


What is kerberos secured cluster in apache pig?

393


What is Cassandra-Cqlsh?

66


Explain Spark Executor

192


How do I download spark?

185