Subject: Re: [xsl] Character encoding/representation from ISO-8859-1 to UTF-8 From: "Wolfgang Laun wolfgang.laun@xxxxxxxxx" <xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx> Date: Tue, 11 Oct 2016 19:23:15 -0000 |
The characters E2 80 99 are the UTF-8 encoding of the Unicode character RIGHT SINGLE QUOTATION MARK. Simply changing the ISO-8859-1 in your XML file to UTF-8 should fix this. On 11 October 2016 at 21:00, Bridger Dyson-Smith bdysonsmith@xxxxxxxxx < xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx> wrote: > Hi all, > > I'm struggling with a character encoding issue (or a character > representation issue maybe?): I have input XML that looks like this > > input.xml > <?xml version="1.0" encoding="iso-8859-1"?> > <documents> > <document>The reality of the effect of natural ventilation in a > residential attic cavity has been the topic of many debates and scholarly > reports since the 1930C"b,b"s.</document> > </documents> > > and I would like to get it to a point where the characters are represented > properly, i.e. > > output.xml > <?xml version="1.0" encoding="UTF-8"?> > <documents> > <document>The reality of the effect of natural ventilation in a > residential attic cavity has been the topic of many debates and scholarly > reports since the 1930bs.</document> > </documents> > > Thanks to Liam's help on irc and reading through the list archives, it > seems like an identity transform should be the right step towards getting > the representation corrected, but something isn't working (or I have a > misunderstanding somewhere). > > If I apply the following identity transform with Saxon HE 9.6.0.7 in > oXygen 18: > <?xml version="1.0" encoding="UTF-8"?> > <xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform" > version="2.0"> > <xsl:output encoding="UTF-8" indent="yes"/> > <xsl:template match="/"><xsl:copy-of select="/"/></xsl:template> > </xsl:stylesheet> > > I get the following result: > <?xml version="1.0" encoding="UTF-8"?> > <documents> > <document>The reality of the effect of natural ventilation in a > residential attic cavity has been the topic of many debates and scholarly > reports since the 1930C"€™s.</document> > </documents> > > Could someone provide some insight into what I've done wrong here? Any > help would be greatly appreciated. > > Best, > Bridger > > XSL-List info and archive <http://www.mulberrytech.com/xsl/xsl-list> > EasyUnsubscribe <-list/528976> (by > email <>)
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
[xsl] Character encoding/representa, Bridger Dyson-Smith | Thread | Re: [xsl] Character encoding/repres, Imsieke, Gerrit, le- |
[xsl] Character encoding/representa, Bridger Dyson-Smith | Date | Re: [xsl] Character encoding/repres, Imsieke, Gerrit, le- |
Month |