Hadoop Interview Questions
Questions Answers Views Company eMail

Discuss the various running mode of Apache Spark?

200

Describe Spark SQL?

222

Explain SparkContext in Apache Spark?

216

What are the types of Transformation in Spark RDD Operations?

196

Explain first() operation in Apache Spark RDD?

241

What are the ways in which Apache Spark handles accumulated Metadata?

256

Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?

199

Is it possible to run Apache Spark without Hadoop?

192

What is Apache Spark Streaming?

204

How can you implement machine learning in Spark?

181

List some commonly used Machine Learning Algorithm Apache Spark?

186

What is the command to start and stop the Spark in an interactive shell?

207

List out the ways of creating RDD in Apache Spark?

194

What are the various advantages of DataFrame over RDD in Apache Spark?

193

What is flatmap in apache spark?

205


Un-Answered Questions { Hadoop }

What happens when a DataNode fails during the write process?

380


When would you use hbase?

142


Is spark faster than hadoop?

200


How to open a connection in hbase?

106


Why lazy evaluation is good in spark?

191






What are the different types of partitioners in cassandra? Explain.

51


What is cassandra used for?

42


How can we create rdds in apache spark?

192


What is HDFS block size and what did you chose in your project?

620


If you run a select * query in hive, why does it not run mapreduce?

492


Explain about the major libraries that constitute the Spark Ecosystem?

255


Is it possible to add or delete column families in a working group?

41


Name the ports Cassandra uses?

53


What will be the result when you do cast(‘abc’ as int)?

468


What happens when the node running the map task fails before the map output has been sent to the reducer?

375