I am trying to parse XML in java on
DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance();
DocumentBuilder db = dbf.newDocumentBuilder();
Document doc = db.parse(new ByteArrayInputStream(msg.getBytes("UTF-8")));
Everything seems to be fine, too
<data>Àâü</data>
correctly understood (especially regarding German umlauts).
But when I try to make out
<dΓ€ta>xxx</dΓ€ta>
the parser throws an exception, names with umlauts in it don't seem to work:
org.w3c.dom.DOMException: WFΓ€ at org.apache.harmony.xml.dom.NodeImpl.setName(NodeImpl.java:286) at
org.apache.harmony.xml.dom.AttrImpl.<init>(AttrImpl.java:55) at
org.apache.harmony.xml.dom.DocumentImpl.createAttribute(DocumentImpl.java:324) at
org.apache.harmony.xml.parsers.DocumentBuilderImpl.parse(DocumentBuilderImpl.javββa:314) at
org.apache.harmony.xml.parsers.DocumentBuilderImpl.parse(DocumentBuilderImpl.javββa:321) at
org.apache.harmony.xml.parsers.DocumentBuilderImpl.parse(DocumentBuilderImpl.javββa:128)
source
share