Subject: Re: [xsl] UTF-8 to ASCII encoding problem From: David Carlisle <davidc@xxxxxxxxx> Date: Thu, 19 May 2005 16:17:22 +0100 |
I receive documents in UTF-8 which may contain pound signs correctly encoded as &163;. I need to style these into ASCII and so my style sheet defines the output encoding - <xsl:output method="xml" indent="yes" encoding="ISO-8859-1"/>. That's latin1 not ascii (ASCII is for people who think that "pound" means "#" and does not include a pound (sterling) symbol.) the sequence EF BF BD Sounds like a bug but you may be able to use encoding="US-ASCII"/> so that the output really is ASCII, then pound signs will be output as as a character reference such as &#A3; which may work better. So long as your input file isn't being corrupted when moving between systems. For example your mail has the line on two platforms one Windows, which looks like on two platforms \226 one Windows, on my email reader which displays unprintable characters in octal. octal 226 is byte 150 which is a non printing control character in iso-8859-1 (I believe it may be something else in a windows code page) David ________________________________________________________________________ This e-mail has been scanned for all viruses by Star. The service is powered by MessageLabs. For more information on a proactive anti-virus service working around the clock, around the globe, visit: http://www.star.net.uk ________________________________________________________________________
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
[xsl] UTF-8 to ASCII encoding probl, john.watson1@xxxxxxx | Thread | RE: [xsl] UTF-8 to ASCII encoding p, Michael Kay |
Re: [xsl] Ascii end-of-file charact, Kevin Rodgers | Date | Re: [xsl] Ascii end-of-file charact, David Carlisle |
Month |