By default, the describe
Dask DataFrame method only sums numeric columns. According to the docs, I should be able to get descriptions of categorical columns by providing a parameter include
. anyway
df.describe(include=['category']).compute()
leads to
TypeError: describe() got an unexpected keyword argument 'include'
.
I also tried a slightly different approach:
df.select_dtypes(include=['category']).describe().compute()
and this time I get
ValueError: DataFrame contains only non-numeric data.
Could you please advise how best to summarize categorical columns in a Dask DataFrame?
grześ source
share