Subject: Re: [xsl] Problem with encoding UTF-8 From: Kevin Rodgers <kevin.rodgers@xxxxxxx> Date: Wed, 15 Dec 2004 10:24:56 -0700 |
David at roamware writes: > If I take the file he sent me and in UltraEdit 32 use the UNICODE/UTF-8 -> > UTF-8 conversion option save the file and then pop it through my program, > all works fine. This ambiguous conversion is explained thus "This function > will convert the complete file from Unicode or UTF-8 (ASCII representation) > to UTF-8 (with the file internally as Unicode)" > > So I am at a bit of a loss to explain what the file format has to do with > this, the PDF exports the file with the "encoding=UTF-8" in the xml element. > Any experience of this behaviour and how to get around it? I cannot change > what the PDF exports so it will have to be a "not strict" switch or > something on the parser I suppose (couldn't find reference to such a thing > mind you.). Can you examine the differences between the two files? (I would use GNU Emacs and its M-x ediff-files command.) What does GNU recode tell you about the original file when you "convert" it with utf-8..dump (with and without the --strict option)? -- Kevin Rodgers
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
Re: [xsl] Problem with encoding UTF, Geert Josten | Thread | Re: [xsl] Problem with encoding UTF, David at roamware |
Re: [xsl] Space after <a> tag in te, Geert Josten | Date | Re: [xsl] Space after <a> tag in te, David Carlisle |
Month |