Why does this multiprocessing.pool implementation not work?

Here is the code I'm using:

def initFunction(arg1, arg2):
    def funct(value):
        return arg1 * arg2 * value
    return funct

os.system("taskset -p 0xff %d" % os.getpid()) 
pool = Pool(processes=4)
t = np.linspace(0,1,10e3)

a,b,c,d,e,f,g,h = sy.symbols('a,b,c,d,e,f,g,h',commutative=False)

arg1 = sy.Matrix([[a,b],[c,d]])
arg2 = sy.Matrix([[e,f],[g,h]])
myFunct = initFunction(arg1, arg2)

m3 = map(myFunct,t) # this works
m4 = pool.map(myFunct,t) # this does NOT work

The error I am getting is:

Traceback (most recent call last):
   File "<stdin>", line 1, in <module>
   File "/usr/lib/python2.7/dist-packages/spyderlib/widgets/externalshell/sitecustomize.py", line 540, in runfile
      execfile(filename, namespace)
   File "/home/justin/Research/mapTest.py", line 46, in <module>
      m4 = pool.map(myFunct,t) 
   File "/usr/lib/python2.7/multiprocessing/pool.py", line 251, in map
      return self.map_async(func, iterable, chunksize).get()
   File "/usr/lib/python2.7/multiprocessing/pool.py", line 558, in get
      raise self._value
cPickle.PicklingError: Can't pickle <type 'function'>: attribute lookup __builtin__.function failed

So what does this error mean and how can I multiprocess this map function?

+4
source share
1 answer

The objects that you pass between the processes by using multiprocessing, must be imported from the module __main__, so that they can be scattered in the child element. Nested functions, such as funct, cannot be imported from __main__, so you get this error. You can achieve what you are trying using functools.partial:

from multiprocessing import Pool
from functools import partial

def funct(arg1, arg2, value):
    return arg1 * arg2 * value


if __name__ == "__main__":
    t = [1,2,3,4]
    arg1 = 4 
    arg2 = 5 

    pool = Pool(processes=4)
    func = partial(funct, arg1, arg2)
    m4 = pool.map(func,t)
    print(m4)

Conclusion:

[20, 40, 60, 80]
+7

Source: https://habr.com/ru/post/1548286/


All Articles