Answer Posted / Ankur Jain
The Directed Acyclic Graph (DAG) is crucial in Apache Spark as it allows efficient scheduling and execution of tasks. Each edge in the graph represents a dependency between two operations, with the source operation being computed before the target operation. By analyzing the dependencies in the DAG, Spark can optimize the order of task execution to minimize data shuffling and improve performance.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers