[xsl] Why does the tokenize() function behave strangely when I use ENTITIES and variables?

Subject: [xsl] Why does the tokenize() function behave strangely when I use ENTITIES and variables?
From: "Costello, Roger L. costello@xxxxxxxxx" <xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx>
Date: Thu, 7 Apr 2016 13:40:10 -0000
Hi Folks,

I have a stylesheet which reads a text file and tokenizes it. The token
delimiter is two consecutive newline characters (hex 0A, hex 0A).

If I use the tokenize() function like this:

	tokenize($text-file, '&#x0A;&#x0A;')

then the text file is correctly tokenized.

But if I create an entity:

<!DOCTYPE xsl:stylesheet [
    <!ENTITY line-separator     '&#x0A;'>
]>

and a variable whose value is two line-separators:

<xsl:variable name="rule-separator"
select="'&line-separator;&line-separator;'"/>

and then use the variable with the tokenize() function:

	tokenize($text-file, $rule-separator)

then the text file is not tokenized correctly. Specifically, the XSLT
processor uses two consecutive space characters (hex 20, hex 20) as the token
delimiter rather than two consecutive newline characters (hex 0A, hex 0A) as
the token delimiter.

Do you know why this is happening? How do I fix it?

/Roger

Current Thread