PMF is expensive to create, so only one for each application makes sense. And getting PM per request also makes sense, since it's not expensive. Actually, it depends on what you plan to do with objects between requests, whether you want to separate them, and then reconnect to the next request.
Utilization of GAE / J applications, obviously, leads to the fact that the new PMF postpones the delay, which is necessary for Google; there was a thread on their forum that looked at timings for initializing various things, and I suggested what they could do, besides their own, so that they invested a resource in this effort.
NTN
source share