"Apache Spark processes large datasets by dividing the data into small

How does apache spark work?

Question Posted / Shweta Kashyap

1 Answers
311 Views
I also Faced
E-Mail Answers

Answer Posted / Shweta Kashyap

"Apache Spark processes large datasets by dividing the data into smaller pieces called Resilient Distributed Datasets (RDDs) and distributing these pieces across a cluster of machines. Each machine runs tasks that operate on its assigned portion of the data, and results are aggregated to produce the final output. Spark also provides high-level APIs for common big data processing tasks like SQL, streaming, and machine learning."n

Is This Answer Correct ?

0 Yes

0 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

What is meant by Transformation? Give some examples.

328

Explain how RDDs work with Scala in Spark

355

What is the latest version of spark?

288

List the advantage of Parquet file in Apache Spark?

474