Hadoop Interview Questions
Questions Answers Views Company eMail

What is the user of sparkContext?

271

How is the processing of streaming data achieved in Apache Spark? Explain.

249

Can you do real-time processing with Spark SQL?

273

Discuss the role of Spark driver in Spark application?

245

What are the features of RDD, that makes RDD an important abstraction of Spark?

229

What is Apache Spark? What is the reason behind the evolution of this framework?

236

What are accumulators in Apache Spark?

269

What is the reason behind Transformation being a lazy operation in Apache Spark RDD? How is it useful?

343

Explain about the different types of trformations on dstreams?

259

Describe the run-time architecture of Spark?

244

What is the FlatMap Transformation in Apache Spark RDD?

248

can you run Apache Spark On Apache Mesos?

267

Describe Partition and Partitioner in Apache Spark?

262

Describe Accumulator in detail in Apache Spark?

273

List down the languages supported by Apache Spark?

233


Un-Answered Questions { Hadoop }

What is the difference between RDBMS with Hadoop MapReduce?

658


Define HRegionServer in HBase

141


What is available mechanism for connecting from applications, when we run hive as a server?

596


Differentiate between piglatin and hiveql?

421


What are the main components of hadoop?

302


How NameNode tackle Datanode failures in HDFS?

30


What is pseudo-distributed mode?

470


What is the function of HMaster?

180


What is identity mapper and reducer? In which cases can we use them?

869


What makes Apache Spark good at low-latency workloads like graph processing and machine learning?

277


Does spark run hadoop?

255


Which companies are mostly using Hive ?

612


Is apache spark going to replace hadoop?

297


Which command do we use to run HBase Shell?

178


What is standalone mode in spark?

297