Big Data Interview Questions
Questions Answers Views Company eMail

How many ways we can create rdd?

198

What does repartition do in spark?

197

What is the driver program in spark?

183

What is spark submit?

188

How do I clear my spark cache?

178

What is a partition in spark?

213

What is spark vectorization?

188

What is off heap memory in spark?

182

What is a tuple in spark?

195

Is spark an etl?

190

How is rdd distributed?

199

What are the common transformations in apache spark?

188

What is the difference between dataset and dataframe in spark?

221

What is distributed cache in spark?

201

What is catalyst framework in spark?

191


Un-Answered Questions { Big Data }

What is the significance of the line set hive.mapred.mode = strict;?

488


What if a namenode has no data?

411


Explain the Avro SASL Profile?

65


Can you tell us how many daemon processes run on a hadoop system?

342


List of some best tools that can be useful for data-analysis?

239






How can we look for the namenode in the browser?

401


Name three features of using Apache Spark

193


How we can check hadoop sqoop installed or not in a system?

5


What is the local repository and where it is useful while using ambari environment?

48


How businesses could be benefitted with Big Data?

267


Can Apache Kafka be used without Zookeeper?

671


Explain tobag function?

320


What is MapFile?

647


Explain about Hadoop file system and processing framework?

243


How mahout used with python ?

91