Explain how can you minimize data transfers when working with spark?

Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...

Explain how can you minimize data transfers when working with spark?

Question Posted / atul goel

1 Answers
450 Views
I also Faced
E-Mail Answers

Explain how can you minimize data transfers when working with spark?..

Answer / Amit Katiyar

Minimizing data transfers in Apache Spark can be achieved by several methods: caching RDDs that are used multiple times, using repartitioning techniques like coalesce() to reduce the number of partitions and therefore the amount of shuffle operations, and using sort-merge join instead of broadcast join when possible.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More Apache Spark Interview Questions

What are the types of cluster managers in spark?

What's rdd?

What is spark accreditation?

Name types of Cluster Managers in Spark.

Is bigger than spark driver maxresultsize?

Can you explain benefits of spark over mapreduce?

What is driver memory and executor memory in spark?

Explain about the different types of transformations on DStreams?

How do I download adobe spark?

What is Catalyst framework?

What operations does rdd support?

What is the use of flatmap in spark?

For more Apache Spark Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)