I wrote a program to read in the Windows DNS debug log, but inside I always got some funny characters in the domain field.
The following is one example:
(13) \ xc2 \ xb5 \ xc2 \ xb1 \ xc2 \ xbe \ xc3 \ xa2p \ xc3 \ xb4 \ xc2 \ x8d (5) example (3) com (0) '
I want to replace everything \x..with?
I explicitly type \ xc2 as follows
line = '(13)\xc2\xb5\xc2\xb1\xc2\xbe\xc3\xa2p\xc3\xb4\xc2\x8d(5)example(3)com(0)'
re.sub('\\\xc2', '?', line)
result: '(13)?\xb5?\xb1?\xbe\xc3\xa2p\xc3\xb4?\x8d(5)example(3)com(0)'
But it does not work if I write as follows:
re.sub('\\\x..', '?', line)
How can I write a regex to replace them?
source
share