Removing consecutive characters with the Posgresql regexp_replace function

Question

Removing consecutive characters with the Posgresql regexp_replace function

Remove all duplicate repeating characters using a regular expression.

In Javascript, this works well:

txt='aaa bbb 888 bbb ccc ddd'.replace(/(?!(?!(.)\1))./g,'');

Returns 'a b 8 b c d'

How to do this using the Posgeresql regexp_replace function? This will not work:

SELECT regexp_replace('aaa bbb 888 bbb ccc ddd',E'(?!(?!(.)\\\\1)).','g');

$ psql -c "SELECT regexp_replace('aaa bbb 888 bbb ccc ddd',E'(?!(?!(.)\\1)).','g');"
     regexp_replace      
-------------------------
 aaa bbb 888 bbb ccc ddd
(1 row)

$ psql -c "SELECT regexp_replace('aaa bbb 888 bbb ccc ddd','(?!(?!(.)\1)).','g');"   
ERROR:  invalid regular expression: invalid backreference number

What am I doing wrong?

+4

regex postgresql

Ians Aug 16 '16 at 4:47

source share

1 answer

mgamba · Answer 1 · 2017-01-22T00:38:17+0000

There's a similar SO question to help you get the answer:

SELECT regexp_replace('aaa bbb 888 bbb ccc ddd', '(.)\1{1,}', '\1', 'g');
 regexp_replace 
----------------
 a b 8 b c d
(1 row)

It uses backreference to capture groups of repeated characters.

Removing consecutive characters with the Posgresql regexp_replace function

More articles: