Spark processes data in resilient distributed datasets (RDDs), which are co

How do sparks work?

Question Posted / Ankush Panwar

1 Answers
361 Views
I also Faced
E-Mail Answers

Answer Posted / Ankush Panwar

Spark processes data in resilient distributed datasets (RDDs), which are collections of data that can be stored and manipulated across a Spark cluster. Each RDD is partitioned into smaller chunks called tasks, which are executed by worker nodes. Spark uses lineage to keep track of the dependencies between different operations and optimize computations.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

What is the latest version of spark?

288

What is meant by Transformation? Give some examples.

328

Explain how RDDs work with Scala in Spark

355

List the advantage of Parquet file in Apache Spark?

474