Do I need to know scala to learn spark?
What is the difference between Caching and Persistence in Apache Spark?
What are the types of Apache Spark transformation?
What is a spark shuffle?
Explain about the different types of trformations on dstreams?
Where is apache spark used?
How is rdd distributed?
Is Apache Spark a good fit for Reinforcement learning?
What is spark rdd?
How does broadcast join work in spark?
What is mlib in apache spark?
Discuss the role of Spark driver in Spark application?
Describe coalesce() operation. When can you coalesce to a larger number of partitions? Explain.
What is the bottom layer of abstraction in the Spark Streaming API ?
How sparksql is different from hql and sql?