Big Data Interview Questions
Questions Answers Views Company eMail

List some commonly used Machine Learning Algorithm Apache Spark?

186

What is the command to start and stop the Spark in an interactive shell?

207

List out the ways of creating RDD in Apache Spark?

192

What are the various advantages of DataFrame over RDD in Apache Spark?

193

What is flatmap in apache spark?

205

What is the standalone mode in spark cluster?

164

Explain apache spark streaming? How is the processing of streaming data achieved in apache spark?

190

In what ways sparksession different from sparkcontext?

238

Explain fold() operation in spark?

200

Define sparkcontext in apache spark?

190

List out the various advantages of dataframe over rdd in apache spark?

193

What is map in apache spark?

184

Write the command to start and stop the spark in an interactive shell?

187

Define various running modes of apache spark?

190

What are the ways to run spark over hadoop?

182


Un-Answered Questions { Big Data }

Explain what is Cassandra-Cqlsh?

74


Is spark part of hadoop ecosystem?

200


What is Reducer in Hadoop?

250


Define the level of parallelism and its need in spark streaming?

233


What are some alternatives to apache kafka?

286






What happen if number of reducer is set to 0 in Hadoop?

241


Define SSTable?

52


What types of costs are associated in creating index on hive tables?

497


What are the different features of Cassandra?

59


Can we submit the mapreduce job from slave node?

840


What is the future of Big Data trend?

313


What is the meaning of the term "non-DFS used" in Hadoop web-console?

940


What is application master in spark?

181


Define NoSQL Database?

68


What is the difference between a Hadoop and Relational Database and Nosql?

716