If there is certain data that we want to use again and again in different transformations, what should improve the performance?
304Explain transformation in rdd. How is lazy evaluation helpful in reducing the complexity of the system?
346Post New Apache Spark Questions
List the various types of "Cluster Managers" in Spark.
When running Spark applications, is it necessary to install Spark on all the nodes of YARN cluster?
How rdd can be created in spark?
What is spark rdd?
What is the use of flatmap in spark?
Can we broadcast an rdd?
What is difference between scala and spark?
What is an "Accumulator"?
Do I need to know hadoop to learn spark?
Explain transformation in rdd. How is lazy evaluation helpful in reducing the complexity of the system?
Can you define rdd lineage?
Name a few commonly used spark ecosystems?
Is apache spark part of hadoop?
What are the common transformations in apache spark?
Why is Spark RDD immutable?