|
Subject: Re: [xsl] XSLT script to report Unicode characters and code blocks in file? From: David Carlisle <davidc@xxxxxxxxx> Date: Fri, 30 May 2008 12:47:03 +0100 |
Colin
> Yes. XML Schema (and hence XPath) regular expressions.
They don't help do they?
Take alpha U+0391. The UCD says that is Lu so it matches \p(Lu) but that
just tells you it's a lower case letter, it doesn't tell you it's in the
block
<block start="00370" end="003FF" name="Greek and Coptic"/>
does it? The code I pointed to in the message you replied to would take
an alpha, get its code point, and find the string "0039" as being the
first four digits of a five digit hex representation of the codepoint,
then find this block element in unicode.xml, and thus (for example) to
http://www.unicode.org/charts/PDF/U0370.pdf
which is the pdf file which has the alpha glyph example.
Actually regexp could help, you could take the block range information
and build a regexp that matches each block by generating teh required
charater range expressions, but I think it's more natural to do that as
an xpath query rather than forcing it through the regexp engine.
David
________________________________________________________________________
The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.
This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs.
________________________________________________________________________
| Current Thread |
|---|
|
| <- Previous | Index | Next -> |
|---|---|---|
| RE: [xsl] XSLT script to report Uni, Michael Kay | Thread | RE: [xsl] XSLT script to report Uni, Michael Kay |
| Re: [xsl] X3D XSLT - re yesterday, David Carlisle | Date | [xsl] Know repeated values, IZASKUN GUTIERREZ GU |
| Month |