Re: [xsl] Support for lookaround regexp in XSLT -- any time soon?

Subject: Re: [xsl] Support for lookaround regexp in XSLT -- any time soon?
From: "Imsieke, Gerrit, le-tex" <gerrit.imsieke@xxxxxxxxx>
Date: Fri, 01 Mar 2013 12:02:38 +0100
On 01.03.2013 11:40, Michael Kay wrote:
(b) they wanted to exclude anything that didn't make sense in an
international Unicode context (so things like word boundaries were
immediately suspect)

If they had been concerned about what is a word constituent and what is not in a certain language, they wouldnbt have included \w and \W in http://www.w3.org/TR/xmlschema-2/#cces


\w is locale-independently defined as:
[#x0000-#x10FFFF]-[\p{P}\p{Z}\p{C}] (all characters except the set of "punctuation", "separator" and "other" characters)


So I think \b, defined as either a \w-\W, a \W-\w boundary, the start anchor, or the end anchor, is also perfectly well-defined and works as expected in most circumstances.

Gerrit

Current Thread