I received files, which, unfortunately, I can’t get information about how they were generated. I need to parse these files.
The file is fully ASCII, in addition, for one character: 0xDB (in decimal it gives 219).
Obviously (looking at the file) this symbol is a currency symbol. I know this because:
- It is imperative that these files contain a currency symbol wherever the amount is displayed.
- there are no other currency symbols (neither $, nor euro, nor anything) anywhere in the files
- every time 0xDB appears next to the quantity
I think in these files that 0xDB should represent the Euro symbol (in fact, it is very likely that this 0xDB appears where the euro symbol should appear).
The file command says this:
ISO-8859 English text, with CRLF, LF line terminators
hexdump gives the following:
00000030 71 75 61 6e 74 20 db 32 2e 36 30 0a 20 41 49 4d |quant .2.60. AIM| ^^ ^
All files are usually formatted / parsed anyway. In fact, I get all the information, except that this strange character is 0xDB.
Does anyone know what is going on? How did the currency symbol (supposedly the euro symbol) somehow become 0xDB?
This is neither ISO-8859-1 (aka ISO Latin 1), nor ISO-8859-15, because in both cases the code point 219 corresponds to "Û" (the same as the Unicode code number 219 is "LATIN CAPITAL LETTER U WITH A CIRCUMFLEX "),
It is not expanded - ASCII.