A large number of relationship types in a cypher request

Question

A large number of relationship types in a cypher request

I am prototyping a data authorization / protection scheme in Neo4j and I am having a strange problem with one of my requests. For the background, the concept is that a user trying to get from a may be if they have the correct access identifier. So, our edges have types that have access identifiers. I am testing this circuit by creating many nodes and connecting their pairs with different accesses. That is, I have many sets:

(a)-[:ACCESS_A]->(b)

With different appeals. I ask them:

 {some query} with a match (a)-[:ACCESS_A|:ACCESS_B|<...>|:ACCESS_Z]->(b) return b

where the size of the list at the matching edge increases with the amount of access available to users.

All this works fine until the list gets access to 201. At this point, the db hits and the time spent WAY up are shown in the profile. In 200 relationship types, the profile shows 1051 dB, but 201 relationship types shows 31801. This is a 30x increase for another type! Time increases in a similar way. the transition from 199 to 200 only increases by about 50 strokes and that is due to an increase in the number of nodes.

After more detailed work, it seems that the round number 200 is more a red herring than a problem. Previously, my relationship types were 4 characters. When I changed them to 9 characters (adding "EDGE_" as a test), the problem started in 50 types - 50 has 36 hits, and 51 - 291 have a smaller jump, but significant compared to the previous increase in the same test.

There seems to be some relation-name relation to where the request falls, but I'm still researching.

Things that I tested and didn't find are of interest:

general request length (string size): it fails with completely different request sizes with 4 and 9 character relationship types
length of the list in the sentence [e: <...>] (line size). As above, it fails at very different sizes
the number of nodes or edges in the graph

+5

neo4j

Tal Feb 02 '17 at 20:58

source share

2 answers

Inversefalcon · Answer 1 · 2017-02-02T22:40:06+0000

As far as I know, you should not run into performance issues with only 200 relationship types.

Prior to version 3.0, the number of relationship types was limited to 64k. This limit has been removed with version 3.0.

Tal · Answer 2 · 2017-02-03T20:59:25+0000

I managed to find a solution to my problem. It appears that the Neo4j request for more different types of relationships than exists is causing the problem. I was able to use a lot more than 200 when all these types existed. Therefore, the solution is so that you do not request any types that are not represented on the graph.

A large number of relationship types in a cypher request

More articles: