RE: [xsl] re: Generate identifier

Subject: RE: [xsl] re: Generate identifier
From: "Michael Kay" <mike@xxxxxxxxxxxx>
Date: Thu, 7 Jan 2010 13:39:05 -0000
For some reason Unicode doesn't classify this as a decomposable character
(heaven only knows what criteria they were using). So you'll have to
special-case it, I think.

Regards,

Michael Kay
http://www.saxonica.com/
http://twitter.com/michaelhkay

> -----Original Message-----
> From: Vladimir Nesterovsky [mailto:vladimir@xxxxxxxxxxxxxxxxxxxx]
> Sent: 07 January 2010 13:27
> To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
> Subject: [xsl] re: Generate identifier
>
> Hello!
>
> Proceeding with my original question.
>
> Is there a way to decompose characters like:
> f 'LATIN SMALL LETTER AE' (U+00E6)
>
> into a separate letters?
> Are there many such characters derived from Latin (I'll be
> calling replace() if it's only one or two)?
>
> Thanks.
> --
> Vladimir Nesterovsky
> http://www.nesterovsky-bros.com/
>
>
> > I need to convert a string into an identifier.
> > Earlier I was using the following function:
> >
>
> > Now, I have to build a name with only containing [A-Za-z0-9] only.
> > My problem is that I often see characters with modifiers
> like 00E0 `
> > LATIN SMALL LETTER A WITH GRAVE
> > 00E1 a LATIN SMALL LETTER A WITH ACUTE
> > 00E2 b LATIN SMALL LETTER A WITH CIRCUMFLEX
> > 00E3 c LATIN SMALL LETTER A WITH TILDE
> > 00E4 d LATIN SMALL LETTER A WITH DIAERESIS ...
> >
> > My questions:
> >   is it acceptable, from the perspective of a western
> language, to replace those characters with a character
> without modifier;
> >   is there a way to do this in xslt;
> >   any better option?

Current Thread