Pandas crosstabs, but with values from aggregation of the third column

Question

Pandas crosstabs, but with values from aggregation of the third column

Here is my problem:

df = pd.DataFrame({'A': ['one', 'one', 'two', 'two', 'one'] ,
                   'B': ['Ar', 'Br', 'Cr', 'Ar','Ar'] ,
                   'C': [1, 0, 0, 1,0 ]})

I would like to generate something like the output of a function pd.crosstab, but the values at the intersection of the column and row should come from the aggregation of the third column:

    Ar,  Br, Cr
one 0.5 0  0
two 1  0  0

For example, there are two cases of “one” and “Ar” the corresponding values in column “C” are 1.0, we sum the values in column “C” (0 + 1) and divide by the number of values in column “C”, therefore we get (0 + 1) / 2 = 0.5. Whenever a combination is missing, we (for example, “Cr” and “one”) set it to zero. Any thoughts?

+4

python pandas aggregate

user1700890 Sep 27 '16 at 10:05

source share

2 answers

pivot_table(), aggfunc='mean' :

In [46]: df.pivot_table(index='A', columns='B', values='C', fill_value=0)
Out[46]:
B     Ar  Br  Cr
A
one  0.5   0   0
two  1.0   0   0

+5

MaxU 27 . '16 22:07

piRSquared · Accepted Answer · 2016-09-27T22:08:45+0000

groupby unstack

df.groupby(['A', 'B']).C.mean().unstack(fill_value=0)

Pandas crosstabs, but with values ​​from aggregation of the third column

More articles:

Pandas crosstabs, but with values from aggregation of the third column