What are the differences between Caching and Persistence method in Apache Spark?
Answer Posted / Rjeev Saxena
Caching and Persistence are methods used to keep data in memory for faster access. However, they differ in their approach and use case.n1) Caching: It is a default persistence method used in Spark which stores RDDs in memory of executors. Data is only cached when an action is triggered.n2) Persistence: This allows users to persist data across actions using the `persist()` or `checkpoint()` functions. Unlike caching, persisted RDDs can survive shuffle operations.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers