Is there an equivalent pandas.cut () in Dask?
I am trying to use bin and group a large dataset in Python. This is a list of measured electrons with properties (positionX, positionY, energy, time). I need to group it by position X, positionY and binning in energy classes.
So far I could do this with pandas, but I would like to run it in parallel. So I'm trying to use dask.
The groupby method works very well, but, unfortunately, I have difficulty trying to collect data into energy. I found a solution using pandas.cut (), but to call compute () on an raw data set (turning it into non-parallel code) you need to call. Is there an equivalent pandas.cut () in dask, or is there another (elegant) way to achieve the same functionality?
import dask
Thanks a lot!
Y. Ac source share