Explain the operation transformation and action in Apache Spark RDD?
Answer / Gagandeep
Transformations are a series of operations performed on an RDD that produce a new RDD. They are lazy, meaning they are not executed immediately but stored as tasks in a task graph. Examples include map(), filter(), and groupBy(). Actions are functions that return a value other than another RDD or DataFrame. Actions trigger the execution of all previous transformations and return the final result. Examples include count(), collect(), and saveAsTextFile().
| Is This Answer Correct ? | 0 Yes | 0 No |
What does apache spark stand for?
What is in memory processing in spark?
Is apache spark part of hadoop?
What is map side join?
What is Directed Acyclic Graph(DAG)?
What are the cases where Apache Spark surpasses Hadoop?
Where is rdd stored?
Define "Transformations" in Spark
How does Apache Spark handles accumulated Metadata?
What is the use of spark sql?
When creating an RDD, what goes on internally?
Name some companies that are already using Spark Streaming?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)