Answer Posted / Upendra Yadav
RDD (Resilient Distributed Dataset) is an immutable distributed collection of data that can be operated on in parallel. It serves as the fundamental data structure in Apache Spark for large-scale data processing. RDDs are fault-tolerant, meaning they can automatically recover from node failures and maintain consistency across the cluster.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers