Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...


What is the difference between coalesce and repartition in spark?



What is the difference between coalesce and repartition in spark?..

Answer / Prem Shankar Jha

Coalesce and Repartition are operations used to change the number of partitions of a DataFrame or RDD. Repartition re-distributes data across new partitions, while Coalesce consolidates existing partitions (and potentially reducing the overall partition count) by combining smaller ones into larger ones. The main difference is that Repartition may shuffle the data more than Coalesce as it needs to redistribute the data among different partitions.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Spark Interview Questions

Does spark sql use hive?

1 Answers  


What are the drawbacks of Apache Spark?

1 Answers  


Explain about mappartitions() and mappartitionswithindex()

1 Answers  


What is standalone mode in spark?

1 Answers  


What are the benefits of lazy evaluation?

1 Answers  


What is the standalone mode in spark cluster?

1 Answers  


What is action, how it process data in apache spark

1 Answers  


What are the limitations of Spark?

1 Answers  


What is lambda in spark?

1 Answers  


What is the difference between rdd and dataframe?

1 Answers  


What is partitioner spark?

1 Answers  


How do you parse data in xml? Which kind of class do you use with java to pass data?

1 Answers  


Categories