A parallel package is a merger of multicore and snow packages, but if you want to run on multiple nodes, you must use the "snow function" in parallel (that is, the parallel part obtained from snow ). In fact, this means that you need to call makeCluster with the argument “type” set to “PSOCK”, “SOCK”, “MPI” or “NWS”, because these are the only cluster types supported by the current parallel version that support multi-execution nodes. If you are using a cluster managed by knowledgeable HPC system administrators, you should use "MPI", otherwise it might be easier to use "PSOCK" (or "SOCK" if you have a specific reason to use the "snow" package).
If you decide to create an "MPI" cluster, you must run the script through R using the mpirun command with the "-n 1" parameter, and the first makeCluster argument to the number of workers that should be spawned. (If you do not know what this means, you may not want to use this approach.)
If you decide to create a "PSOCK" or "SOCK" cluster, the first argument to makeCluster must be a host name vector, and makeCluster will start workers on these nodes using the "ssh" makeCluster when makeCluster is running. This means that you must have ssh daemons running on all the specified nodes.
I wrote a lot more on this topic elsewhere, but hopefully this helps you get started.
source share