A more elegant way to calculate intragroup proportions in dplyr?

Question

A more elegant way to calculate intragroup proportions in dplyr?

Given a data_frame df <- data_frame(X = c('A', 'A', 'B', 'B', 'B'), Y = c('M', 'N', 'M', 'M', 'N')), I need to create a data_frame that tells us that 50% Aare M, 50% A- N, 67% of B M, and 33% B- N.

I have a little routine that I use for this, but it seems awful.

library(tidyverse)
df <- data_frame(X = c('A', 'A', 'B', 'B', 'B'), Y = c('M', 'N', 'M', 'M', 'N')) 
# here we go...
df %>% 
  group_by(X) %>% 
  mutate(n_X = n()) %>% 
  group_by(X, Y) %>% 
  summarise(PERCENT = n() / first(n_X))

which issues

Source: local data frame [4 x 3]
Groups: X [?]

      X     Y   PERCENT
  <chr> <chr>     <dbl>
1     A     M 0.5000000
2     A     N 0.5000000
3     B     M 0.6666667
4     B     N 0.3333333

Is there a better way to do this? Of course, I missed something.

+4

r dplyr tidyverse

crf Jan 24 '17 at 6:38

source share

3 answers

Sven hohenstein · Answer 1 · 2017-01-24T06:46:00+0000

You can use prop.table:

df %>% 
  group_by(X, Y) %>%
  count() %>%
  mutate(PERCENT = prop.table(n))

Result:

      X     Y     n   PERCENT
  <chr> <chr> <int>     <dbl>
1     A     M     1 0.5000000
2     A     N     1 0.5000000
3     B     M     2 0.6666667
4     B     N     1 0.3333333

Ronak shah · Answer 2 · 2017-01-24T07:05:00+0000

We can try in the R base with tableandrowSums

new_df <- table(df$X, df$Y)
new_df/rowSums(new_df)

#          M         N
#  A 0.5000000 0.5000000
#  B 0.6666667 0.3333333

Sandipan Dey · Answer 3 · 2017-01-24T07:36:52+0000

:

dplyr

library(dplyr)
df %>%  count(X, Y) %>%
  mutate(prop = n / sum(n))

base R

tbl <- xtabs(~X+Y, df)
as.data.frame(tbl/rowSums(tbl), responseName = "prop")

data.table

library(data.table)
DT <- data.table(df)[, .N, by = .(X,Y)]
setDT(DT)[, prop := N/sum(N), by = 'X']
DT

#   X Y N      prop
#1: A M 1 0.5000000
#2: A N 1 0.5000000
#3: B M 2 0.6666667
#4: B N 1 0.3333333

A more elegant way to calculate intragroup proportions in dplyr?

More articles: