y can clone or download the psutil project at the following link: https://github.com/giampaolo/psutil.git
then run setup.py to install psutil
in 'spark / python / pyspark / shuffle.py' y you can see the following codes:
def get_used_memory(): """ Return the used memory in MB """ if platform.system() == 'Linux': for line in open('/proc/self/status'): if line.startswith('VmRSS:'): return int(line.split()[1]) >> 10 else: warnings.warn("Please install psutil to have better " "support with spilling")** if platform.system() == "Darwin": import resource rss = resource.getrusage(resource.RUSAGE_SELF).ru_maxrss return rss >> 20
therefore, I assume yr os is not linux, so psutil is suggested.
source share