Answer Posted / Kum.anjali Singh
RDD (Resilient Distributed Dataset) is a fundamental data structure in Spark and PySpark. It represents an immutable distributed collection of objects that can be processed in parallel across multiple nodes in a cluster.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers