Explain the terms Spark Partitions and Partitioners?

Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...

Explain the terms Spark Partitions and Partitioners?

Question Posted / arti pal

1 Answers
314 Views
I also Faced
E-Mail Answers

Explain the terms Spark Partitions and Partitioners?..

Answer / Sheshmani Arya

In Spark, a partition is a logical division of data, used to manage parallelism. Each partition contains a subset of the original data. A Partitioner is an algorithm that determines how to split data into partitions.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More Apache Spark Interview Questions

What is spark dynamic allocation?

What is lineage graph?

What is Spark Dataset?

How rdd can be created in spark?

What is the command to start and stop the Spark in an interactive shell?

How can we create RDD in Apache Spark?

What is data skew in spark?

What is map in spark?

What is spark pipeline?

What is difference between rdd and dataframe?

What is shuffle spill in spark?

What is the default partition in spark?

For more Apache Spark Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)