How is RDD in Apache Spark different from Distributed Storage Management?
Answer Posted / Himanchal
RDD (Resilient Distributed Dataset) in Apache Spark is a distributed collection of data that can be cached in memory for reuse, while Distributed Storage Management refers to the process of managing and organizing data across multiple computers in a distributed computing environment. RDDs are an abstraction on top of distributed storage management systems, providing a unified programming interface for various data sources.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers