When using Python Pandas to read CSV, you can specify an index column. Is this possible using Python Dask when reading a file, as opposed to setting an index after?
For example, using pandas:
df = pandas.read_csv(filename, index_col=0)
Ideally, using dask might be as follows:
df = dask.dataframe.read_csv(filename, index_col=0)
I tried
df = dask.dataframe.read_csv(filename).set_index(?)
but the index column has no name (and this seems slow).
source
share