What is ancestry in Spark? How adaptation to internal failure is accomplished in Spark utilizing Lineage Graph?
Answer Posted / Udayraj Singh
"In Apache Spark, lineage refers to the data processing history of a dataset. The lineage graph helps Spark track the operations performed on datasets and enables it to recompute results if necessary when a task fails. Ancestry is a part of this lineage that represents the parents (or input) of a dataset."
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers