Answer Posted / Siddharth Gautam
The Coalesce operation in Apache Spark is used to reduce the number of partitions in a DataFrame or Dataset. It combines multiple partitions into fewer partitions, which can help optimize resources during computation. This operation is useful when you have too many partitions and need to balance the load across workers.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers