Re: [xsl] Re: backticks in regex - tales of the unexpected p

On 7-4-2014 18:35, Ihe Onwuka wrote:
> Just going by the definition of the \w class in MK's XPath 2.0
> reference - \w -> a character considered to form part of a word

Essentially, it is the Unicode standard that defines whether something
is outside of \p{C}, \p{Z} or \p{P}. And I would find it rather strange
is "accent grave" would _not_ be considered a possible part of a word,
similarly to diaeresis, breve, cedilla etc. The counterpart, the acute
accent, is categorized the same. But not apostrophe, which is often
considered an acute accent, but really isn't.

I understand the confusion: consider the math and currency symbols, from
the same XSLT book you are quoting, it tells you that they are part of
it as well. How is $, + or > a word character? I don't know. I guess the
Unicode consortium just had to draw the line somewhere.

> So it's TS if backtick isn't a word character in your vocabulary.
> Probably neither the first or the last to get caught by that one.

Not sure what TS means. But I'm sure you are not the last to get caught
by that one. Personally, I hardly ever use \w because I find it very
hard to understand what it does and does not match. The following is
word? Tell`>me$45).

I find it easiest to define the subranges myself, or use the
\p{Category} syntax, which I find clearer.

Cheers,
Abel

Current Thread
Re: [xsl] Re: backticks in regex - tales of the unexpected part II, (continued) Ihe Onwuka - 7 Apr 2014 16:35:55 -0000 David Carlisle - 7 Apr 2014 16:50:04 -0000 Ihe Onwuka - 7 Apr 2014 17:04:21 -0000 Abel Braaksma (Exselt) - 7 Apr 2014 17:27:55 -0000 Abel Braaksma (Exselt) - 7 Apr 2014 16:58:56 -0000 <= Michael Kay - 8 Apr 2014 17:20:41 -0000 Ihe Onwuka - 8 Apr 2014 17:28:59 -0000 Wolfgang Laun - 8 Apr 2014 18:01:30 -0000 Ihe Onwuka - 8 Apr 2014 18:10:33 -0000

Current Thread

Re: [xsl] Re: backticks in regex - tales of the unexpected part II, (continued)
- Michael Kay - 8 Apr 2014 17:20:41 -0000
  - Ihe Onwuka - 8 Apr 2014 17:28:59 -0000
    - Wolfgang Laun - 8 Apr 2014 18:01:30 -0000
    - Ihe Onwuka - 8 Apr 2014 18:10:33 -0000

<- Previous	Index	Next ->
Re: [xsl] Re: backticks in regex - , Abel Braaksma (Exsel	Thread	[xsl] Re: [xquery-talk] backticks i, Michael Kay
Re: [xsl] Re: backticks in regex - , David Carlisle	Date	Re: [xsl] Re: backticks in regex - , Ihe Onwuka
	Month

<-prev [Thread] next->	<-prev [Date] next->
Month Index \| List Home

Re: [xsl] Re: backticks in regex - tales of the unexpected part II