Answer Posted / Satyendra Kumar Tiwari
RDD (Resilient Distributed Dataset) is a fundamental data structure in Apache Spark that represents an immutable distributed collection of objects. DataFrames, on the other hand, provide a programming interface for manipulating structured data (such as tables with columns and rows), including support for SQL-like queries and more advanced data types beyond primitives like integers and strings.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers