RE: [xsl] Unicode character blocks in strings

Subject: RE: [xsl] Unicode character blocks in strings
From: tom tom <tomxsllist@xxxxxxxxxxx>
Date: Fri, 29 May 2009 12:46:58 +0100
I don't understand why the regex \p{{IsCJKUnifiedIdeographs}} contains a double '{}' structure. Why doesn't \p{IsCJKUnifiedIdeographs} work?



----------------------------------------
> From: mike@xxxxxxxxxxxx
> To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
> Date: Tue, 26 May 2009 14:16:56 +0100
> Subject: RE: [xsl] Unicode character blocks in strings
>
>
> Try:
>
> 
> 
> 
> 
> 
> 
> 
> 
>
> Regards,
>
> Michael Kay
> http://www.saxonica.com/
> http://twitter.com/michaelhkay
>
>> -----Original Message-----
>> From: tom tom [mailto:tomxsllist@xxxxxxxxxxx]
>> Sent: 26 May 2009 14:08
>> To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
>> Subject: [xsl] Unicode character blocks in strings
>>
>>
>> I have a string containing a mix of Chinese and Latin
>> characters, eg *|.Z'J%R+,H1N1,y7PZL.
>> I wish to return a nodeset containing the following kind of structure:
>>
>>
>>
>> *|.Z'J%R+,
>> H1N1
>> ,y7PZL
>>
>>
>> Where H1N1 falls into the BasicLatin unicode character block
>> and the other two strings can be categorized as CJKUnifiedIdeographs.
>>
>> Can anyone suggest the cleanest way to do this using XSLT 2?
>>
>> Tom
>>
>> _________________________________________________________________
>> View your Twitter and Flickr updates from one place !V Learn more!
>> http://clk.atdmt.com/UKM/go/137984870/direct/01/
>

_________________________________________________________________
Beyond Hotmail !X see what else you can do with Windows Live.
http://clk.atdmt.com/UKM/go/134665375/direct/01/

Current Thread