Explain about transformations and actions in the context of RDDs.
Answer / Ashok Kumar Ray
Transformations in Apache Spark are lazy operations that create new datasets (RDDs) based on existing ones. They are applied to an RDD but do not trigger execution. Actions, on the other hand, are execution triggers that process and return a value back to the driver program. Examples of transformations include map(), filter(), and groupBy(). Examples of actions include count(), first(), and saveAsTextFile().
| Is This Answer Correct ? | 0 Yes | 0 No |
Is it necessary to install spark on all the nodes of a YARN cluster while running Apache Spark on YARN ?
Name the languages which are supported by apache spark and which one is most popular?
What is difference between map and flatmap in spark?
What is deploy mode in spark?
Is databricks a database?
What is javardd spark?
What are the two ways to create rdd in spark?
Is spark and hadoop same?
What can skew the mean?
What is a databricks cluster?
What are the abstractions of Apache Spark?
What role does worker node play in Apache Spark Cluster? And what is the need to register a worker node with the driver program?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)