What is the principle? When does Spark process data that exceeds memory?

Question

What is the principle? When does Spark process data that exceeds memory?

As I know, Spark uses memory to cache data, and then to calculate data in memory. But what if the data has more memory? I could read the source code, but I don’t know which class runs the schedule? Or could you explain the principle of how Spark deals with this issue?

+4

java scala apache-spark

郭同 jet Apr 23 '14 at 2:35

source share

1 answer