Re: Unicode and XSL (was substring())

Subject: Re: Unicode and XSL (was substring())
From: James Clark <jjc@xxxxxxxxxx>
Date: Tue, 08 Jun 1999 08:47:16 +0700
Kay Michael wrote:
> 
> Looking again at the definitions of character-handling functions in XSLT, I
> wonder how translate() can be specified without being able to count
> characters in a string.

Right, but the main point of controversy in the indexing issue is
whether to count 16-bit quantities or UCS characters.  However for
translate() this issue does not arise since nobody would think that the
the two parts of a surrogate pair should be translated independently.

> (The current definition, of course, is hopelessly
> informal).

It is informal, but I hope it is clear.  Strings are defined as
sequences of UCS characters (ie character means the same as it does in
XML).  Apart from the issue of what happens if the second and third
arguments are of unequal lengths, it seems well defined to me.  If there
are other cases where the intended behaviour is not clear, please report
them to the XSL editors list.

James



 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread