What is accumulators and broadcast variables in spark?
Answer / Ramesh Prasad
Accumulators in Spark are variables that can be updated by each task during a computation. They allow you to store a value that accumulates over the course of the application, but they do not persist across iterations. Broadcast variables, on the other hand, are read-only distributed objects used to broadcast large amounts of data to the workers.
| Is This Answer Correct ? | 0 Yes | 0 No |
How tasks are created in spark?
What is aws spark?
Explain various Apache Spark ecosystem components. In which scenarios can we use these components?
Does Apache Spark provide checkpoints?
Can you define rdd?
is it necessary to install Spark on all nodes while running Spark application on Yarn?
What is spark in big data?
Which is the best spark certification?
What is rdd lineage graph? How is it useful in achieving fault tolerance?
What are accumulators in Apache Spark?
Define Actions.
Explain briefly what is Action in Apache Spark? How is final result generated using an action?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)