Subject: Re: [xsl] Original encoding lost after transformation From: guray@xxxxxxxxxxxxxxxxxxxx Date: Fri, 18 Feb 2005 14:36:16 +0100 |
Hmm, I think you are right. The problem must occur somewhere else. I can properly pass the 2nd transformation after specifying the ISO-8859-1 encoding when serializing the resulting DOM document to a string. However, the XML declaration still specifies UTF-8? There must be a method somewhere in the (Java) APIs that sets the correct encoding declaration instead of I assume this default one? If anyone know where I'd be grateful... Guray On Fri, Feb 18, 2005 at 11:30:58AM +0000, David Carlisle wrote: > > The encoding of the source file is interpreted by the XML parser which > just reports unicode character positions in an encoding neutral way so > xslt doesn't have any record of the original encoding. > > You can ask that it outputs in latin 1 by going > <xsl:output encoding="iso-8859-1"/> > > Note however that every XML application has to accept utf8 but support > for iso-8859-1 is an optional feature so (in theory at least) your > resulting document will be less portable. > > You say you got: > > Some transformations (depending on the source xml) now fail on the > second pass as the first pass returns data with the wrong UTF-8 declaration > above. The error message is: > > java.io.UTFDataFormatException: Invalid byte 2 of 2-byte UTF-8 sequence > > It's most unlikely that XSLT writes out a file with a bad encoding > declaration so this is probably an indication of another error > somewhere in your processing chain. > > Note that if you are doing two transformations back to back from java > (or any other similar API) you almost certainly should not be encoding > and decoding the intermediate stage at all. Most XSLT processors will > allow you to pass the result tree of the first transform directly to the > second transform without having to serialise it in some encoding and > then re-parse it. > > David > > > ________________________________________________________________________ > This e-mail has been scanned for all viruses by Star. The > service is powered by MessageLabs. For more information on a proactive > anti-virus service working around the clock, around the globe, visit: > http://www.star.net.uk > ________________________________________________________________________
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
Re: [xsl] Original encoding lost af, David Carlisle | Thread | [xsl] Muenchian grouping, António Mota |
Re: [xsl] Original encoding lost af, David Carlisle | Date | [xsl] Muenchian grouping, António Mota |
Month |