I am running an instance group of 20 Preemptible GCE instances to read ORC files in Google repository. Data is divided by the hour, about 2 GB every hour.
- What types of instances should I use?
- How many of the Ram should the JVM use?
- I use autoscaling configuration with 80% processor and 10 minute recharge. Are there more subtitle settings for Presto?
- Is there a solution for closing servers due to lack of resources?
Partial responses will also be appreciated.
source
share