I use PIG latin for processing logs, because its expressiveness is in a problem when the data is not large enough to worry about creating a whole cluster of haops. I run PIG in local mode, but I think that it does not use all the cores available to it (16 at the moment), CPU monitoring shows 200% maximum CPU usage.
Is there any tutorial or recommendations for fine tuning PIG for local execution? I am sure that all cartographers can use all available kernels with some easy configuration. (In my script, I already set the default_parallel parameter to 20)
Sincerely.
source share