Answer Posted / Zia Ur Rehman
A PySpark DataFrame is a distributed collection of data organized into named columns. It is similar to a table in a relational database or a data frame in R, and it provides a programming interface for Spark's RDD (Resilient Distributed Datasets).
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers