In Excel, how can I convert the contents of a cell that includes accented characters, curly quotes, etc. in HTML for the same characters, or a transliterated version of plaintext?
We have an XLS document that contains some "tall" characters. The data was pulled through a connection to the database, and it seems that Excel correctly processes individual cells (or rows) located in different code files.
When we export this data to CSV, some tall characters are not displayed correctly - it seems that Excel uses a single encoding for the document (of course) and the value of the character bit from the original code page (which may or may not correspond to other values ββin the same document).
Since Excel displays the text correctly before exporting, I believe that we should be able to encode tall characters according to their HTML equivalents at this point, and then export to CSV, thereby ensuring that CSV is only ASCII.
(Alternatively, we could transliterate to simple ASCII, but this seems like a bad approach and probably not easier ...)
source
share