Add index to adjacent equal value runs

Question

Add index to adjacent equal value runs

Is there a faster way to make a counter index than using a loop? Within continuous runs of equal values, the index must be the same. I think the cycle is very slow, especially when the data is so big.

The input and the desired output are shown for illustration.

x <- c(2, 3, 9, 2, 4, 4, 3, 4, 4, 5, 5, 5, 1)

Desired Result Counter:

 c(1, 2, 3, 4, 5, 5, 6, 7, 7, 8, 8, 8, 9)

Please note that non-contiguous runs have different indices. For instance. see desired indices of values 2 and 4

My inefficient code:

 group[1]<-1 counter<-1 for (i in 2:n){ if (x[i]==x[i-1]){ group[i]<-counter }else{ counter<-counter+1 group[1]<-counter} }

+6

performance loops r indexing counting

Reens May 19, '15 at 0:01

source share

3 answers

Using data.table , which has the rleid() function:

 require(data.table) # v1.9.5+ rleid(x) # [1] 1 2 3 4 5 5 6 7 7 8 8 8 9

+8

Arun May 19 '15 at 12:21

source share

This will work with numeric character values:

 rep(1:length(rle(x)$values), times = rle(x)$lengths) #[1] 1 2 3 4 5 5 6 7 7 8 8 8 9

You can also be more efficient by calling rle only once (about 2 times faster), and a very small speed improvement can be done using rep.int instead of rep :

 y <- rle(x) rep.int(1:length(y$values), times = y$lengths)

+6

Jota May 19 '15 at 12:27

source share

Mrflick · Accepted Answer · 2015-05-19T00:18:55+0000

If you have numerical values like this, you can use diff and cumsum to add changes to the values

 x <- c(2,3,9,2,4,4,3,4,4,5,5,5,1) cumsum(c(1,diff(x)!=0)) # [1] 1 2 3 4 5 5 6 7 7 8 8 8 9

Add index to adjacent equal value runs

More articles: