I am having trouble figuring out why HXT is replacing my DTDs. Firstly, here is my input file for analysis:
<!DOCTYPE html> <html> <head> <title>foo</title> </head> <body> <h1>foo</h1> </body> </html>
and this is the result that I get:
<?xml version="1.0" encoding="US-ASCII"?> <html> <head> <title>foo</title> </head> <body> <h1>foo</h1> </body> </html>
Finally, here is a simplified version of the arrows that I use:
start (App src dest) = runX $ readDocument [ withValidate no , withSubstDTDEntities no , withParseHTML yes --, withTagSoup ] src >>> this >>> writeDocument [ withIndent yes , withSubstDTDEntities no , withOutputHTML --, withOutputEncoding "UTF-8" ] dest
I apologize for the comments - I played with various combinations of configurations. I just can't get HXT to not interfere with DTD, even with withSubstDTDEntities no , withValidate no , etc. I get a warning that HXT is ignoring my doctype declaration, but this is the only bit of discernment I have, Can someone please lend me a hand? Thank you in advance!
source share