Big Data Interview Questions
Questions Answers Views Company eMail

What is apache spark for beginners?

191

What is deploy mode in spark?

208

What is pair rdd?

208

What is data pipeline in spark?

203

What is a spark rdd?

221

What are the optimization techniques in spark?

182

Can you run spark on windows?

197

Why is spark good?

195

Do I need to know hadoop to learn spark?

204

Is a distributed machine learning framework on top of spark?

192

How does hadoop achieve fault tolerance?

218

Is hadoop still in demand?

220

What is winutils hadoop?

237

Is hive a nosql database?

377

Is hive similar to sql?

424


Un-Answered Questions { Big Data }

What are the parameters used to create keyspace in cassandra?

46


What do the master class and the output class do?

394


How can you compare Hadoop and Spark in terms of ease of use?

196


How to call impala built-in functions?

43


What are the use cases of Apache Pig?

500






List the functions of Spark SQL?

377


Explain how can you change a column data type in Hive?

445


What is a shuffle block in spark?

182


How many JVMs run on a slave node?

647


Explain how mapreduce works.

361


How did you debug your Hadoop code ?

998


Does Hadoop requires RAID?

653


What is the difference between leader and follower in kafka?

318


Does google use hadoop?

385


Explain SparkContext in Apache Spark?

216