What is lineage graph in spark?
Answer / Santosh Kumar Yadav
"A Lineage Graph in Spark represents the history of transformations applied on a dataset from its source to the current RDD (Resilient Distributed Datasets). It helps in recomputing data that was lost due to task failures.".
| Is This Answer Correct ? | 0 Yes | 0 No |
How is hadoop different from spark?
Can you explain benefits of spark over mapreduce?
In how many ways RDDs can be created? Explain.
What is executor memory in spark?
What is difference between coalesce and repartition?
Which are the various data sources available in spark sql?
What happens when you submit spark job?
Which are the methods to create rdd in spark?
What are the roles of the file system in any framework?
What is the use of spark sql?
What are the advantages of datasets in spark?
What is the difference between hive and spark?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)