Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...


What is RDD in Apache Spark? How are they computed in Spark? what are the various ways in which it can create?



What is RDD in Apache Spark? How are they computed in Spark? what are the various ways in which it c..

Answer / Neeraj Kumar Soni

RDD (Resilient Distributed Datasets) in Apache Spark is an immutable distributed collection of data that can be manipulated using transformations and actions. RDDs are computed by splitting large datasets into smaller chunks called partitions, each residing on a single node. RDDs can be created from various sources such as Hadoop files (textFile), local files (textFile("localfile")), or even other RDDs.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Spark Interview Questions

What is a tuple in spark?

1 Answers  


What is dataproc cluster?

1 Answers  


Which is the best spark certification?

1 Answers  


Can you define rdd lineage?

1 Answers  


What is spark lineage?

1 Answers  


What are broadcast variables in spark?

1 Answers  


Is apache spark going to replace hadoop?

1 Answers  


How is Apache Spark better than Hadoop?

1 Answers  


What is the difference between dataframe and dataset in spark?

1 Answers  


How spark works on hadoop?

1 Answers  


Compare Transformation and Action in Apache Spark?

1 Answers  


What does MLlib do?

1 Answers  


Categories