Hadoop Interview Questions
Questions Answers Views Company eMail

What is heap memory in spark?

186

What is external shuffle service in spark?

210

What is spark client?

196

Which are the various data sources available in spark sql?

200

Can you run spark without hadoop?

218

What is tungsten engine in spark?

224

What is stage and task in spark?

151

What is spark execution engine?

206

Is spark sql a database?

184

Is spark distributed computing?

218

What is row rdd in spark?

201

How is rdd fault?

206

How does reducebykey work in spark?

183

What is apache spark for beginners?

199

What is deploy mode in spark?

214


Un-Answered Questions { Hadoop }

Can multiple clients write into a Hadoop HDFS file concurrently?

33


When we send a data to a node, do we allow settling in time, before sending another data to that node?

279


What is partitioner spark?

193


Can a partition be archived? What are the advantages and Disadvantages?

487


Elaborate on cassandra - cql?

50






What is Cassandra Data Modelling ?

59


Is Pig script case sensitive?

331


Is map like a pointer?

681


Is spark based on hadoop?

205


What is pair rdd in spark?

207


How do ‘map’ and ‘reduce’ work?

360


Why is cqlsh used?

108


Differentiate between drop and truncate in cqlsh

53


Explain the default level of parallelism in Apache Spark

232


How do you define a partitioning key?

323