Re: [xsl] XSLT script to report Unicode characters and code blocks in file?

Subject: Re: [xsl] XSLT script to report Unicode characters and code blocks in file?
From: David Carlisle <davidc@xxxxxxxxx>
Date: Thu, 29 May 2008 22:13:33 +0100
>  Has anyone invented this wheel already?

not the whole wheel but you may be able to borrow a few spokes from the
xml file and stylesheets in
http://www.w3.org/203/entities/2007xml

http://www.w3.org/2003/entities/2007xml/

unicode.xml has some xml with the unicode block ranges, which
characters.xsl uses to make table headings in tables such as

http://www.w3.org/2003/entities/2007doc/020.html

rather than use lots of xsl:choose coding you can just take firs 5  hex
value of the character code, ten just pick up the last block with a
start value (lexicographically) less than that in a sequence pulled from
this fragent of unicode.xml

   <unicodeblocks>
      <block start="00000" end="0007F" name="C0 Controls and Basic Latin"/>
      <block start="00080" end="000FF" name="C1 Controls and Latin-1 Supplement"/>
      <block start="00100" end="0017F" name="Latin Extended-A"/>
      <block start="00180" end="0024F" name="Latin Extended-B"/>
      <block start="00250" end="002AF" name="IPA Extensions"/>
...


characters.xsl has variious utility functions for getting to and from
hex strings 

David


________________________________________________________________________
The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.

This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs. 
________________________________________________________________________

Current Thread