App Engine deserializes entries in python: is it really that slow?

Question

App Engine deserializes entries in python: is it really that slow?

In profiling my app python2.7 app engine, I found that it takes an average of 7 ms per record to deserialize the records extracted from ndb into python objects. (In pb_to_query_result , pb_to_entity and their descendants, this does not include RPC time to query the database and retrieve raw records.)

Is this expected? My model has six properties, one of which is LocalStructuredProperty with 15 properties, which also includes a repeated StructuredProperty with four properties, but the middle object should have less than 30 properties, which, as I said, will contain.

Is such a slowdown expected? I want to get several thousand records to do a simple population analysis, and although I can tolerate a certain amount of latency, more than 10 seconds is a problem. Is there anything I can do to rebuild my models or my circuit to make it more viable? (Besides the obvious solution of pre-calculating my aggregated analysis on a regular basis and caching the results.)

If it is unusual for this to be slow, it would be useful to know that I can go and look for what I can do that makes it worse.

+6

python google-app-engine app-engine-ndb

Tim dierks Aug 15 '13 at 18:58

source share

1 answer

Jasonc · Accepted Answer · 2013-08-16T19:46:30+0000

The short answer is yes.

I find that deserialization in Python is very slow, especially when duplicate properties are involved. GAE-Python deserialization seems to create a boat of objects. He is known to be ineffective, but apparently no one wants to touch him because he is still on the stack.

It is sad. In most cases, we launch F4 Front Ends due to this overhead (i.e. faster CPU acceleration == faster deserialization).

App Engine deserializes entries in python: is it really that slow?

More articles: