A Spark Shuffle is an operation that is performed during tasks in Apache Sp

What is a spark shuffle?

Question Posted / Manik Dubey

1 Answers
349 Views
I also Faced
E-Mail Answers

Answer Posted / Manik Dubey

A Spark Shuffle is an operation that is performed during tasks in Apache Spark to sort data before or after the reduce phase. It involves redistributing data among nodes, causing a significant overhead in terms of network bandwidth and CPU usage. However, it ensures that data is processed in sorted order if required.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

What is meant by Transformation? Give some examples.

328

What is the latest version of spark?

288

Explain how RDDs work with Scala in Spark

355

List the advantage of Parquet file in Apache Spark?

473