[xsl] Unparse-text() string contains ascii chars 29, 30 and 31

Subject: [xsl] Unparse-text() string contains ascii chars 29, 30 and 31
From: andrew welch <andrew.j.welch@xxxxxxxxx>
Date: Wed, 19 Oct 2005 16:50:27 +0100
I'm trying to process some data that's one long string delimited using
ascii characters 29, 30 and 31 (which are apparently group, record and
unit 'separator characters').

I can get access to the string using unparsed-text(), but when I
attempt to process the string using any of the function eg:

tokenize($str, '&#29;')

or

substring-before($str, '&#31;')

...the XML parser complains that these aren't legal XML characters
(when the stylesheet itself is parsed).

Is there any way around this?  I can't see how I can process the
string in XSLT without using the characters themselves.

The two alternative's I can see are to use an XMLFilter to turn it
into XML using Java, or to go back to the source to get them to export
their data in a less archaic way...

Current Thread