Hadoop Interview Questions
Questions Answers Views Company eMail

What is the standalone mode in spark cluster?

213

Explain apache spark streaming? How is the processing of streaming data achieved in apache spark?

252

In what ways sparksession different from sparkcontext?

287

Explain fold() operation in spark?

256

Define sparkcontext in apache spark?

231

List out the various advantages of dataframe over rdd in apache spark?

238

What is map in apache spark?

229

Write the command to start and stop the spark in an interactive shell?

227

Define various running modes of apache spark?

241

What are the ways to run spark over hadoop?

258

What is catalyst query optimizer in apache spark?

244

What are the various types of shared variable in apache spark?

235

Define the common faults of the developer while using apache spark?

250

What is the use of spark driver, where it gets executed on the cluster?

254

What is speculative execution in spark?

280


Un-Answered Questions { Hadoop }

What is the characteristic of streaming API that makes it flexible run MapReduce jobs in languages like Perl, Ruby, Awk etc.?

350


How does apache flume work?

109


Describe the run-time architecture of Spark?

244


What is pig properties?

425


What is hive on spark?

286


What is tunable consistency in Cassandra?

74


Does the hdfs client decide the input split or namenode?

515


What is a databricks cluster?

343


What is difference between map and flatmap in spark?

244


Command to format the NameNode?

768


Can you explain sqoop metastore?

3


What is spark in python?

240


What is apache flume used for?

68


What is dataproc cluster?

240


what is Memtable in Cassandra?

101