Re: [xsl] breaking string into substrings or arrays (XSLT 1.0 solution) tokenize

Subject: Re: [xsl] breaking string into substrings or arrays (XSLT 1.0 solution) tokenize
From: Dimitre Novatchev <dnovatchev@xxxxxxxxx>
Date: Thu, 21 Apr 2005 06:04:47 +1000
On 4/21/05, RahilQ <qamar_rahil@xxxxxxxxxxx> wrote:
> Hi
>
> If I want to recognize a sequence of letters and characters such as
> 'atnnnn' where 'at' are letters and 'nnnn' are numbers between 0-9 what
> would be the best way to write a token for it ?
>
> Would it be 'at0-9' or something else ?

See the other thread in this list discussing exactly this topic.

>
> Possibly trivial but could you please explain what the expression &#xA;
> in your token below signifies? Is the actual expression this or '#xA;' ?

The actual expression is exactly as given. &#xA; is the character
reference for linefeed.


Cheers,
Dimitre Novatchev

>
> Thanks
> Rahil
>
>
> Dimitre Novatchev wrote:
>
> >If you set the delimiters to:
> >
> >    "'.:;,&#xA;'"
> >
> >then the result is:
> >
> ><word>good morning</word>
> ><word> can you help me</word>
> ><word> alpha </word>
> ><word> alpha beta</word>
> ><word/>
> >
> >If you want, you may additionally normalize-space() in a second pass.
> >
> >
> >Cheers,
> >Dimitre Novatchev
> >
> >
> >
> >On 4/20/05, Ali Choumane <ali.choumane@xxxxxxx> wrote:
> >
> >
> >>Hi
> >>Thanks for your help, but in my project, if I have this entry:
> >><?xml version="1.0"?>
> >><paragraph>
> >>good morning; can you help me: alpha , alpha beta.
> >></paragraph>
> >>
> >>I would have such result:
> >><?xml version="1.0"?>
> >><sentence>good morning</sentence>
> >><sentence>can you help me<sentence>
> >><sentence>alpha<sentence>
> >><sentence>alpha beta<sentence>
> >>
> >>but I think ,the solutions you provided give an output as:
> >><?xml version="1.0"?>
> >><sentence>good</sentence>
> >><sentence>morning</sentence>
> >><sentence>can</sentence>
> >><sentence>you</sentence>
> >><sentence>help</sentence>
> >><sentence>me<sentence>
> >><sentence>alpha<sentence>
> >><sentence>alpha</sentence>
> >><sentence>beta<sentence>
> >>
> >>can you help me?
> >>
> >>
> >>Quoting "Pawson, David" <David.Pawson@xxxxxxxxxxx>:
> >>
> >>
> >>
> >>>    -----Original Message-----
> >>>    From: Dimitre Novatchev
> >>>
> >>>    Please, note that the str-split-to-words template generally
> >>>    handles *a
> >>>    set* of delimiters, not just one delim character.
> >>>
> >>>    Therefore, one can set the pDelimiters parameters say to:
> >>>
> >>>    "  ,;?\!"
> >>>
> >>>    and it will retrieve words that are delimited by any pair of these.
> >>>
> >>>
> >>>which is exactly what was asked for earlier! Nice one.
> >>>   I've never needed multiple separators before.
> >>>This guy did.
> >>>regards DaveP
> >>>
> >>>--
> >>>DISCLAIMER:
> >>>
> >>>NOTICE: The information contained in this email and any attachments is
> >>>confidential and may be privileged.  If you are not the intended
> >>>recipient you should not use, disclose, distribute or copy any of the
> >>>content of it or of any attachment; you are requested to notify the
> >>>sender immediately of your receipt of the email and then to delete it
> >>>and any attachments from your system.
> >>>
> >>>RNIB endeavours to ensure that emails and any attachments generated by
> >>>its staff are free from viruses or other contaminants.  However, it
> >>>cannot accept any responsibility for any  such which are transmitted.
> >>>We therefore recommend you scan all attachments.
> >>>
> >>>Please note that the statements and views expressed in this email and
> >>>any attachments are those of the author and do not necessarily represent
> >>>those of RNIB.
> >>>
> >>>RNIB Registered Charity Number: 226227
> >>>
> >>>Website: http://www.rnib.org.uk
> >>>
> >>>
> >>>
> >>Ali CHOUMANE
> >>M2R III ,Univ Joseph fourrier, UFR IMA
> >>WAM/INRIA & GETA/CLIPS
> >>portable.      06 71 55 80 65
> >>Bureau(INRIA). 04 76 61 54 13
> >>Email. ali.choumane@xxxxxxx
> >>
> >>-------------------------------------------------
> >>envoyi via Webmail/IMAG !

Current Thread