I have two data frames:
df.1 <- data.frame(loc = c('A','B','C','C'), person = c(1,2,3,4), str = c("door / window / table", "window / table / toilet / vase ", "TV / remote / phone / window", "book / vase / car / chair"))
In this way,
loc person str 1 A 1 door / window / table 2 B 2 window / table / toilet / vase 3 C 3 TV / remote / phone / window 4 C 4 book / vase / car / chair
and
df.2 <- data.frame(loc = c('A','B','C'), str = c("book / chair / chair", " table / remote / vase ", "window"))
what gives,
loc str 1 A book / chair / car 2 B table / remote / vase 3 C window
I want to create a variable df.1$percentage , which calculates the percentages of elements in df.1$str that are in df.2$str edit by loc, or:
loc person str percentage 1 A 1 door / window / table 0.00 2 B 2 window / table / toilet / vase 0.50 3 C 3 TV / remote / phone / window 0.25 4 C 4 book / vase / car / chair 0.00
( 1 has 0/3, 2 has 2/4 matches, 3 has 1/4, and 4 has 0/4)
Thanks!
source share