I know that databases in general can scale horizontally using master / slave replication . This is a great strategy when the number of concurrent readings grows.
As the number of concurrent records or just the amount of data starts to grow, although master / slave replication does not give you anything, so you need to split your data .
This is great for scenarios with key values. A classic example for me is TinyURL / bit.ly; reading / writing data for a short url foo can be completely independent of reading / writing data for a short string url.
But what should you do if you are in a schedule script? More specifically, is it even possible to partition a graph database, such as Neo4j ? If so, how?
I canβt wrap my head around how you could break a graph without defeating the goal of using a graph database (efficient workarounds).
source share