The primary abstraction in Apache Spark is the Resilient Distributed Datase

What are the abstractions of Apache Spark?

Question Posted / Sada Shiv Mishra

1 Answers
319 Views
I also Faced
E-Mail Answers

Answer Posted / Sada Shiv Mishra

The primary abstraction in Apache Spark is the Resilient Distributed Dataset (RDD), which is an immutable distributed collection of data. Other abstractions include DataFrames and Datasets, which provide a more convenient API for manipulating structured data.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

List the advantage of Parquet file in Apache Spark?

474

Explain how RDDs work with Scala in Spark

355

What is meant by Transformation? Give some examples.

328

What is the latest version of spark?

288