Answer Posted / Israr Ahmad
Partitions are subsets of an RDD (Resilient Distributed Dataset) used for parallel processing. Each partition contains a portion of the data and is processed by a different worker node in Apache Spark.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers