RE: [xsl] Unicode character blocks in strings

Try:

<xsl:analyze-string regex="\p{{IsCJKUnifiedIdeographs}}">
<xsl:matching-substring>
  <out><xsl:value-of select="."/></out>
</xsl:matching-substring>
<xsl:non-matching-substring>
  <out><xsl:value-of select="."/></out>
</xsl:non-matching-substring>
</xsl:analyze-string>

Regards,

Michael Kay
http://www.saxonica.com/
http://twitter.com/michaelhkay

> -----Original Message-----
> From: tom tom [mailto:tomxsllist@xxxxxxxxxxx]
> Sent: 26 May 2009 14:08
> To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
> Subject: [xsl] Unicode character blocks in strings
>
>
> I have a string containing a mix of Chinese and Latin
> characters, eg 0"8yM"<WPMH1N1Aw8PH7.
> I wish to return a nodeset containing the following kind of structure:
>
>
>
>   0"8yM"<WPM
>   H1N1
>   Aw8PH7
>
>
> Where H1N1 falls into the BasicLatin unicode character block
> and the other two strings can be categorized as CJKUnifiedIdeographs.
>
> Can anyone suggest the cleanest way to do this using XSLT 2?
>
> Tom
>
> _________________________________________________________________
> View your Twitter and Flickr updates from one place (C Learn more!
> http://clk.atdmt.com/UKM/go/137984870/direct/01/

Current Thread
RE: [xsl] XSL and HTML interaction, (continued) Kerry, Richard - 22 May 2009 15:54:02 -0000 Robert Koberg - 22 May 2009 15:56:58 -0000 Martin Honnen - 22 May 2009 16:03:11 -0000 tom tom - 26 May 2009 13:08:00 -0000 Michael Kay - 26 May 2009 13:17:19 -0000 <= tom tom - 28 May 2009 14:00:46 -0000 Michael Kay - 28 May 2009 14:26:12 -0000 David Carlisle - 28 May 2009 14:58:33 -0000 tom tom - 29 May 2009 11:47:23 -0000

Current Thread

RE: [xsl] XSL and HTML interaction, (continued)
- Kerry, Richard - 22 May 2009 15:54:02 -0000
- Robert Koberg - 22 May 2009 15:56:58 -0000
- Martin Honnen - 22 May 2009 16:03:11 -0000
- tom tom - 26 May 2009 13:08:00 -0000
  - Michael Kay - 26 May 2009 13:17:19 -0000 <=
    - tom tom - 28 May 2009 14:00:46 -0000
    - Michael Kay - 28 May 2009 14:26:12 -0000
    - David Carlisle - 28 May 2009 14:58:33 -0000
    - tom tom - 29 May 2009 11:47:23 -0000

<- Previous	Index	Next ->
[xsl] Unicode character blocks in s, tom tom	Thread	RE: [xsl] Unicode character blocks , tom tom
[xsl] Unicode character blocks in s, tom tom	Date	Re: [xsl] Unicode character blocks , David Carlisle
	Month

<-prev [Thread] next->	<-prev [Date] next->
Month Index \| List Home