Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...


If there is certain data that we want to use again and again in different transformations, what should improve the performance?



If there is certain data that we want to use again and again in different transformations, what shou..

Answer / Mahima Singh

"Caching the DataFrame or DataSet can significantly improve the performance as it stores the data in memory for reuse across multiple actions. Another approach could be using persist() method with MEMORY_ONLY, MEMORY_ONLY_SER, or MEMORY_AND_DISK storage levels depending on the required level of persistence and memory footprint."

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Spark Interview Questions

What is client mode in spark?

1 Answers  


What are the ways to run spark over hadoop?

1 Answers  


Can you define pagerank?

1 Answers  


Different Running Modes of Apache Spark

1 Answers  


Describe the run-time architecture of Spark?

1 Answers  


Can you explain accumulators in apache spark?

1 Answers  


How does spark rdd work?

1 Answers  


What is shuffle in spark?

1 Answers  


What is a dstream in apache spark?

1 Answers  


What is the biggest shortcoming of Spark?

1 Answers  


How is rdd fault?

1 Answers  


Explain briefly what is Action in Apache Spark? How is final result generated using an action?

1 Answers  


Categories