What does repartition do in spark?

Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...

What does repartition do in spark?

Question Posted / mahesh singh

1 Answers
347 Views
I also Faced
E-Mail Answers

What does repartition do in spark?..

Answer / Sandeep Shandilya

Repartition in Apache Spark is a function used to change the number of partitions for a DataFrame or RDD. It helps to balance the data distribution across nodes by either increasing or decreasing the number of partitions.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More Apache Spark Interview Questions

What is the use of spark?

What are the types of Transformation in Spark RDD Operations?

Describe coalesce() operation. When can you coalesce to a larger number of partitions? Explain.

How to create RDD?

What is cluster mode in spark?

Do I need to learn scala for spark?

What is a "Spark Driver"?

What is spark pipeline?

When should you use spark cache?

What is spark vectorization?

Explain the lookup() operation in Spark?

When creating an RDD, what goes on internally?

For more Apache Spark Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)