Answer Posted / Sada Shiv Mishra
The primary abstraction in Apache Spark is the Resilient Distributed Dataset (RDD), which is an immutable distributed collection of data. Other abstractions include DataFrames and Datasets, which provide a more convenient API for manipulating structured data.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers