Answer Posted / Pankaj Singh
Broadcast join is a technique used in Apache Spark to join a large table with a smaller one. In this approach, the larger table (known as the broadcaster) is distributed across all nodes in the cluster, while the smaller table (known as the joiner) remains on each executor. The broadcast table is replicated once on each worker, and then the join operation takes place locally on each node, reducing network communication and improving performance.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers