What is Resilient Distributed Dataset (RDD) in Apache Spark? How does it make sp

Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...

What is Resilient Distributed Dataset (RDD) in Apache Spark? How does it make spark operator rich?

Question Posted / noopur

1 Answers
306 Views
I also Faced
E-Mail Answers

What is Resilient Distributed Dataset (RDD) in Apache Spark? How does it make spark operator rich?..

Answer / Ankit Bhatnagar

Resilient Distributed Dataset (RDD) is the fundamental data structure in Apache Spark. It is an immutable distributed collection of objects. RDDs are fault-tolerant and can handle failures gracefully through lineage information, which allows them to recalculate a partition if a failure occurs. This makes Spark operator rich because it provides a wide range of operations such as map(), filter(), reduce(), and join().

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More Apache Spark Interview Questions

What is meant by in-memory processing in Spark?

Why is rdd immutable?

Explain Dsstream with reference to Apache Spark

What is skew data?

What is spark ml?

By Default, how many partitions are created in RDD in Apache Spark?

What is lambda in spark?

What is a spark context?

What is difference between rdd and dataframe?

Does spark need hadoop?

Can you explain spark streaming?

What is write ahead log(journaling)?

For more Apache Spark Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)