Where is the Spark RDD Lineage stored? According to white paper on RDD, it is stored in memory, but wants to know if it is on the driver's side or somewhere else in the cluster.
As well as fault tolerance, i.e. how many copies of RDD (metadata) are created by default?
I want to understand the behavior of the main structure if we do not use the persist () method.
source
share