Sorry if this is stupid, but I just thought I should take a picture. Say I have a huge graph (for example, 100 billion nodes). Neo4J supports 32 billion, while others support more or less the same, so say that I cannot simultaneously use the entire data set in the database, can I run pagerank on it if its oriented graph (without loops) and each a set of nodes are connected to the next set of nodes (therefore, new links will not be created back, only new links are created for new datasets).
Is there a way, somehow, you can take the previous estimates for pagerank and apply them to new datasets (I only care about pagerank for the most recent dataset, but need the previous pagerank to get the latest datasets)?
It makes sense? If so, can this be done?
source share