Name the two types of shared variable available in Apache Spark?
explain the use of blinkdb?
What is shark?
What is difference between hadoop and spark?
How does yarn work with spark?
How is dag created in spark?
Explain the lookup() operation in Spark?
Explain Spark join() operation?
What is a dataset? What are its advantages over dataframe and rdd?
List the languages supported by Apache Spark?
What are Actions? Give some examples.
List some use cases where Spark outperforms Hadoop in processing.
What is a "Parquet" in Spark?
How is Apache Spark better than Hadoop?
What are the optimization techniques in spark?