Hadoop Interview Questions
Questions Answers Views Company eMail

Discuss the various running mode of Apache Spark?

248

Describe Spark SQL?

266

Explain SparkContext in Apache Spark?

256

What are the types of Transformation in Spark RDD Operations?

239

Explain first() operation in Apache Spark RDD?

280

What are the ways in which Apache Spark handles accumulated Metadata?

310

Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?

243

Is it possible to run Apache Spark without Hadoop?

232

What is Apache Spark Streaming?

265

How can you implement machine learning in Spark?

219

List some commonly used Machine Learning Algorithm Apache Spark?

232

What is the command to start and stop the Spark in an interactive shell?

253

List out the ways of creating RDD in Apache Spark?

235

What are the various advantages of DataFrame over RDD in Apache Spark?

254

What is flatmap in apache spark?

267


Un-Answered Questions { Hadoop }

When and how to create hadoop archive?

319


What are the features of Fully-Distributed mode?

316


Which one is better hadoop or spark?

266


What are the particular functionalities of Nagios in Ambari?

56


Name the components of spark ecosystem.

218


What is identity mapper and chain mapper?

486


What are the site-specific configuration files in Hadoop?

836


What is the process to change the files at arbitrary locations in HDFS?

1232


What combiners are and when you should use a combiner in a mapreduce job?

438


Tell the purpose of Bloom Filter in Cassandra?

133


Mention some important components of cassandra data models?

59


Explain job scheduling through JobTracker

529


What is the future of apache spark?

234


What is org.apache.jute package?

5


Explain the usage of Context Object?

315