RE: [xsl] Unicode character blocks in strings

Subject: RE: [xsl] Unicode character blocks in strings
From: "Michael Kay" <mike@xxxxxxxxxxxx>
Date: Tue, 26 May 2009 14:16:56 +0100
Try:

<xsl:analyze-string regex="\p{{IsCJKUnifiedIdeographs}}">
<xsl:matching-substring>
  <out><xsl:value-of select="."/></out>
</xsl:matching-substring>
<xsl:non-matching-substring>
  <out><xsl:value-of select="."/></out>
</xsl:non-matching-substring>
</xsl:analyze-string>

Regards,

Michael Kay
http://www.saxonica.com/
http://twitter.com/michaelhkay

> -----Original Message-----
> From: tom tom [mailto:tomxsllist@xxxxxxxxxxx]
> Sent: 26 May 2009 14:08
> To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
> Subject: [xsl] Unicode character blocks in strings
>
>
> I have a string containing a mix of Chinese and Latin
> characters, eg 0"8yM"<WPMH1N1Aw8PH7.
> I wish to return a nodeset containing the following kind of structure:
>
>
>
>   0"8yM"<WPM
>   H1N1
>   Aw8PH7
>
>
> Where H1N1 falls into the BasicLatin unicode character block
> and the other two strings can be categorized as CJKUnifiedIdeographs.
>
> Can anyone suggest the cleanest way to do this using XSLT 2?
>
> Tom
>
> _________________________________________________________________
> View your Twitter and Flickr updates from one place (C Learn more!
> http://clk.atdmt.com/UKM/go/137984870/direct/01/

Current Thread