What is coalesce in spark?
Answer / Sakshi Upadhyay
Coalesce in Spark is an operation that re-partitions a DataFrame or Dataset into a specified number of partitions while ensuring that the total amount of data remains the same. This can help to balance the workload among executors.
| Is This Answer Correct ? | 0 Yes | 0 No |
Can you explain spark core?
Please enumerate the various components of the Spark Ecosystem.
What is spark dynamic allocation?
What is write ahead log(journaling)?
What is scala and spark?
What does apache spark do?
What is spark client?
What happens when you submit spark job?
Define functions of SparkCore?
What are the benefits of Spark lazy evaluation?
What is shuffle spill in spark?
What is Starvation scenario in spark streaming?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)