Answer Posted / Mandeep
{"Partition": "A partition is a division of an RDD into smaller, more manageable chunks for parallel processing. Each partition contains a contiguous sequence of rows from the original dataset.""n"Partitioner": "A user-defined or built-in method that determines how data should be split and distributed across partitions in Apache Spark. The Partitioner interface defines the logic for creating a hash function to partition the data.""}
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers