RE: [xsl] Encoding problem

Subject: RE: [xsl] Encoding problem
From: "Michael Kay" <mike@xxxxxxxxxxxx>
Date: Thu, 25 Aug 2005 13:11:07 +0100
> I have a xml with utf-8 encoding, I am parsing that
> xml and then 
> converting it to pdf using FOP.
> While reading the xml file I am providing the encoding
> as windows-1252, 

I suspect that when you say the XML file has "utf-8 encoding", you mean that
it starts with an XML declaration that specifies the encoding as being
utf-8; but that you suspect the encoding is really windows-1252.

Or it could be even worse: perhaps the file started life as windows-1252,
and was then put through an iso-8859-1 to utf-8 conversion. If so you're in
a real mess, and the best hope is to try and reverse the whole process.

First step is to analyze the problem: find one of the "unusual" characters
in the file and tell us (a) what the character is supposed to be, and (b)
the sequence of hex octets used to encode it.

Badly encoded XML files are not really in scope for this list, but since
it's so hard to locate where in the system an encoding problem has its root
cause, we're usually tolerant...

Michael Kay
http://www.saxonica.com/

Current Thread