Re: [xsl] Unicode character decimal representation problem when copying the XML

Subject: Re: [xsl] Unicode character decimal representation problem when copying the XML
From: David Carlisle <davidc@xxxxxxxxx>
Date: Wed, 13 Oct 2010 17:09:08 +0100
On 13/10/2010 16:57, Pradnya Gawade wrote:
Thanks David. That works for unicode characters but I also have data
like 'subject&apos;s' in my xml and this change outputs such values as
'subject's'. So I guess now the problem is for ascii characters in
data. Is it possible to preserve both?

well only by using something like lexev (see Martin's answer) because no normal xml parser is going to report whether the input had a ' or an &apos; The whole point of the entities is that they are equivalent to the characters. The parser will similarly not report other differences in markup.


if you use xslt (or any xml based system) to copy

<foo
 a = 'b'
></foo>

then it may come out as

<foo a="b"/>

You are not supposed to care. If you -do- care then perhaps you should be processing the input as text using sed or perl or something rather than using an xml pipeline that is designed to not notice these syntactic differences.

David





________________________________________________________________________
The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.

This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs. ________________________________________________________________________


Current Thread