What is rdd lineage graph? How is it useful in achieving fault tolerance?
Answer Posted / Harish Agarwal
"RDD Lineage Graph represents the history of RDD transformations and actions, showing how each RDD was derived from its parent RDDs. It helps Spark to recompute any lost data when a failure occurs by reproducing the necessary operations. This enables fault tolerance and consistent results in case of data loss or node failures."
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers