What is the contrast between RDD, DataFrame and DataSets?
Answer Posted / Jabir Husain
RDD (Resilient Distributed Datasets) is Spark's fundamental data structure, providing distributed collections of immutable data. DataFrames are a higher-level abstraction built on top of RDDs, offering support for SQL queries and easy manipulation with Schema objects. DataSets are Scala APIs equivalent to PySpark's DataFrames.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers