Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...


What is difference between coalesce and repartition?



What is difference between coalesce and repartition?..

Answer / Amit Kumar Singh

Coalesce operation in Apache Spark is used to reduce the number of partitions for a DataFrame or RDD, while maintaining their existing order. It combines the contiguous partitions. On the other hand, repartition operation is used to change the total number of partitions for a DataFrame or RDD, which might cause the shuffle of data, and can affect the performance due to increased network traffic.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Spark Interview Questions

What is RDD?

1 Answers  


Which the fundamental data structure of Spark

1 Answers  


Does spark run hadoop?

1 Answers  


What are the ways in which Apache Spark handles accumulated Metadata?

1 Answers  


What file systems does spark support?

1 Answers  


How does broadcast join work in spark?

1 Answers  


Are sparks dangerous?

1 Answers  


How can you trigger automatic clean-ups in Spark to handle accumulated metadata?

1 Answers  


What are the optimization techniques in spark?

1 Answers  


Why is Transformation lazy in Spark?

1 Answers  


What exactly is apache spark?

1 Answers  


By Default, how many partitions are created in RDD in Apache Spark?

1 Answers  


Categories