Big Data Interview Questions
Questions Answers Views Company eMail

What is apache spark for beginners?

193

What is deploy mode in spark?

208

What is pair rdd?

208

What is data pipeline in spark?

203

What is a spark rdd?

221

What are the optimization techniques in spark?

184

Can you run spark on windows?

199

Why is spark good?

197

Do I need to know hadoop to learn spark?

204

Is a distributed machine learning framework on top of spark?

194

How does hadoop achieve fault tolerance?

218

Is hadoop still in demand?

220

What is winutils hadoop?

237

Is hive a nosql database?

379

Is hive similar to sql?

425


Un-Answered Questions { Big Data }

What does it indicate if replica stays out of ISR for a long time?

457


What are the commonalities between pig and hive?

348


When to choose "External Table" in Hive?

401


Can copper cause a spark?

183


Is hbase an os independent approach?

127






Why do we need sparkcontext?

204


How to specify more than one directory as input in the Hadoop MapReduce Program?

403


Use of create-hive-table command in hadoop sqoop?

5


what do you mean by the worker node?

213


Can we run unix shell commands from hive? Give example?

415


What are the management tools in Cassandra?

61


How is the splitting of file invoked in Hadoop ?

266


What is the default port of presto?

5


Explain the hadoop configuration files at present?

367


Can we do online transactions(oltp) using hadoop?

378