Subject: Re: [xsl] Character encoding/representation from ISO-8859-1 to UTF-8 From: "Bridger Dyson-Smith bdysonsmith@xxxxxxxxx" <xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx> Date: Wed, 12 Oct 2016 14:09:21 -0000 |
All, Thank you all so much for your time and trouble. A sed -e '1s/iso-8859-1/UTF-8/' seems to correct the problem (I can't believe I didn't try this when I initially started looking over the original files, I'm usually guilty of reaching for a CLI tool) -- Steven and Wolfgang got me pointed in the right direction. I appreciate all of the helpful suggestions. Best, Bridger On Tue, Oct 11, 2016 at 5:17 PM, Steven D Majewski steve.majewski@xxxxxxxxx <xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx> wrote: > > It looks to me, as Wolfgang said, that it is actually UTF-8 encoding in > your file, and you should just switch the XML encoding statement to state > that. Perhaps Oxygen is btoo smartb to use as a tool b itbs an XML aware > editor, and maybe itbs interpreting the stated XML encoding. You might try > changing the encoding outside with sed and then loading the file into > oxygen. > > ( I tried editing in emacs, and even that appeared to do something to > mangle it. May have to turn of XML mode. ) > > b Steve. > > > > On Oct 11, 2016, at 4:39 PM, Bridger Dyson-Smith bdysonsmith@xxxxxxxxx < > xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx> wrote: > > Hi Steven and thanks for the response, > > On Tue, Oct 11, 2016 at 4:04 PM, Steven D Majewski steve.majewski@gmail. > com <xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx> wrote: > >> >> That doesnbt appear to be valid iso-8859-1 characters, but maybe they are >> getting transformed or mangled in email or cut and pasted into email. Can >> you do a hex dump of the file so we can see the actual byte values? >> >> >> > Here's the output of hexdump: > > hexdump example-input.xml > 0000000 3f3c 6d78 206c 6576 7372 6f69 3d6e 3122 > 0000010 302e 2022 6e65 6f63 6964 676e 223d 7369 > 0000020 2d6f 3838 3935 312d 3f22 0a3e 643c 636f > 0000030 6d75 6e65 7374 0a3e 3c09 6f64 7563 656d > 0000040 746e 543e 6568 7220 6165 696c 7974 6f20 > 0000050 2066 6874 2065 6665 6566 7463 6f20 2066 > 0000060 616e 7574 6172 206c 6576 746e 6c69 7461 > 0000070 6f69 206e 6e69 6120 7220 7365 6469 6e65 > 0000080 6974 6c61 6120 7474 6369 6320 7661 7469 > 0000090 2079 6168 2073 6562 6e65 7420 6568 7420 > 00000a0 706f 6369 6f20 2066 616d 796e 6420 6265 > 00000b0 7461 7365 6120 646e 7320 6863 6c6f 7261 > 00000c0 796c 7220 7065 726f 7374 7320 6e69 6563 > 00000d0 7420 6568 3120 3339 e230 9980 2e73 2f3c > 00000e0 6f64 7563 656d 746e 0a3e 2f3c 6f64 7563 > 00000f0 656d 746e 3e73 > 00000f6 > > > I'll try to attach it as a file. > Cheers, > Bridger > > >> On Oct 11, 2016, at 3:00 PM, Bridger Dyson-Smith bdysonsmith@xxxxxxxxx < >> xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx> wrote: >> >> input.xml >> <?xml version="1.0" encoding="iso-8859-1"?> >> <documents> >> <document>The reality of the effect of natural ventilation in a >> residential attic cavity has been the topic of many debates and scholarly >> reports since the 1930C"b,b"s.</document> >> </documents> >> >> >> XSL-List info and archive <http://www.mulberrytech.com/xsl/xsl-list> >> EasyUnsubscribe (by email) >> > > XSL-List info and archive <http://www.mulberrytech.com/xsl/xsl-list> > EasyUnsubscribe <http://-list/504751> (by email) > <hexdump-input-file> > > > XSL-List info and archive <http://www.mulberrytech.com/xsl/xsl-list> > EasyUnsubscribe <-list/1230532> (by > email <>)
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
Re: [xsl] Character encoding/repres, Steven D Majewski st | Thread | Re: [xsl] Character encoding/repres, Soren Kuula s_kuula@ |
Re: [xsl] tokenize, Michael Kay mike@xxx | Date | [xsl] [ANN] Balisage 2017 Location , Tommie Usdin btusdin |
Month |