Hadoop Interview Questions
Questions Answers Views Company eMail

Discuss the various running mode of Apache Spark?

214

Describe Spark SQL?

228

Explain SparkContext in Apache Spark?

224

What are the types of Transformation in Spark RDD Operations?

204

Explain first() operation in Apache Spark RDD?

245

What are the ways in which Apache Spark handles accumulated Metadata?

264

Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?

203

Is it possible to run Apache Spark without Hadoop?

200

What is Apache Spark Streaming?

206

How can you implement machine learning in Spark?

184

List some commonly used Machine Learning Algorithm Apache Spark?

194

What is the command to start and stop the Spark in an interactive shell?

215

List out the ways of creating RDD in Apache Spark?

200

What are the various advantages of DataFrame over RDD in Apache Spark?

199

What is flatmap in apache spark?

209


Un-Answered Questions { Hadoop }

Can hadoop handle streaming data?

268


Explain Sort Order in brief?

162


Can Apache Kafka be used without Zookeeper?

681


Why does my select statement fail?

41


How data or file is read in Hadoop HDFS?

24






Which are the various data sources available in spark sql?

199


Is it necessary to install spark on all the nodes of a YARN cluster while running Apache Spark on YARN ?

238


Can you explain combiner?

246


What is a JobTracker in Hadoop? How many instances of JobTracker run on a Hadoop Cluster?

794


Can you explain data versioning?

124


Define compaction in HBase?

124


How to keep HDFS cluster balanced?

809


What is hbase fsck?

155


Explain what is sequencefileinputformat?

249


Explian the Limitations of HBase?

133