I know that when your Google App Engine (GAE) application works with 0 instances (since it was inactive for bits) and the user requests a page, the user has to wait for the instance to load and make all instances that can make the user wait a significant amount time.
My question is about the situation when your GAE application already has 1 instance, but begins to experience a heavy load and starts loading the second instance.
In this case, what will happen:
Will the user end up waiting until an instance of the second instance is created before responding to the request?
Will requests be sent to the second instance until it is fully created, so the user will not wait for a longer time?
EDIT: Unfortunately, the answer to this question is No. 1 at the moment. However, there is a function request to change the behavior to # 2. Please run this function request ( http://code.google.com/p/googleappengine/issues/detail?id=2690 ) to attract the attention of App developers Engine
source
share