"Resilient Distributed Datasets (RDD) are the basic data structure of

What is rdd in spark with example?

Question Posted / Neeraj P Singh

1 Answers
297 Views
I also Faced
E-Mail Answers

Answer Posted / Neeraj P Singh

"Resilient Distributed Datasets (RDD) are the basic data structure of Apache Spark. An RDD is an immutable distributed collection of objects that can be processed in parallel across a cluster. For example, you can create an RDD from a list of integers using the `spark.sparkContext.parallelize()` method like so: `val rdd = sparkContext.parallelize(Array(1, 2, 3, 4))`."n

Is This Answer Correct ?

0 Yes

0 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

What is meant by Transformation? Give some examples.

328

What is the latest version of spark?

288

Explain how RDDs work with Scala in Spark

355

List the advantage of Parquet file in Apache Spark?

474