What is spark checkpointing?
Answer / Neeraj Sharma
Spark Checkpointing is a feature that allows RDDs (Resilient Distributed Datasets) to be periodically written to stable storage during a Spark job. This helps to reduce data loss in case of worker failures and speeds up recovery.
| Is This Answer Correct ? | 0 Yes | 0 No |
How many types of Transformation are there?
What is the method to create a data frame?
Is apache spark part of hadoop?
What is a DStream?
What are the languages in which Apache Spark create API?
What is spark and what is its purpose?
Is spark better than mapreduce?
What is partitioner spark?
What operations does rdd support?
What are shared variables?
List commonly used machine learning algorithm?
Name commonly-used Spark Ecosystems
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)