Re: [xsl] Problem with encoding UTF-8

Subject: Re: [xsl] Problem with encoding UTF-8
From: Kevin Rodgers <kevin.rodgers@xxxxxxx>
Date: Wed, 15 Dec 2004 10:24:56 -0700
David at roamware writes:
> If I take the file he sent me and in UltraEdit 32 use the UNICODE/UTF-8 ->
> UTF-8 conversion option save the file and then pop it through my program,
> all works fine. This ambiguous conversion is explained thus "This function
> will convert the complete file from Unicode or UTF-8 (ASCII representation)
> to UTF-8 (with the file internally as Unicode)"
> 
> So I am at a bit of a loss to explain what the file format has to do with
> this, the PDF exports the file with the "encoding=UTF-8" in the xml element.
> Any experience of this behaviour and how to get around it? I cannot change
> what the PDF exports so it will have to be a "not strict" switch or
> something on the parser I suppose (couldn't find reference to such a thing
> mind you.).

Can you examine the differences between the two files?  (I would use GNU
Emacs and its M-x ediff-files command.)

What does GNU recode tell you about the original file when you "convert"
it with utf-8..dump (with and without the --strict option)?

-- 
Kevin Rodgers

Current Thread