Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...


explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in Apache Spark.



explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in A..

Answer / Sadhana Dubey

RDD (Resilient Distributed Dataset) is an immutable distributed collection of objects that provides fault-tolerant parallel processing for large datasets in Apache Spark. It serves as the fundamental data structure for performing computations in Spark. RDDs can be created from various sources such as local files, HDFS files, or even other RDDs using Spark's API (Application Programming Interface). Some ways to create RDDs include textFile(path), wholeTextFiles(path), and parallelize(iterable) in Scala, SparkSession.textFile(path), SparkSession.wholeTextFiles(path), and SparkSession.parallelize(iterable) in Java and Python respectively.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Spark Interview Questions

What is spark good for?

1 Answers  


What is Spark Streaming?

1 Answers  


What is sc parallelize?

1 Answers  


What is full form of rdd?

1 Answers  


What is javardd?

1 Answers  


Explain about the different types of transformations on DStreams?

1 Answers  


Can we install spark on windows?

1 Answers  


Does spark require hadoop?

1 Answers  


What is the difference between persist

1 Answers  


Why is transformation lazy operation in Apache Spark RDD? How is it useful?

1 Answers  


Which language is better for spark?

1 Answers  


explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in Apache Spark.

1 Answers  


Categories