Re: [xsl] breaking string into substrings or arrays (XSLT 1.0 solution) tokenize

Subject: Re: [xsl] breaking string into substrings or arrays (XSLT 1.0 solution) tokenize
From: RahilQ <qamar_rahil@xxxxxxxxxxx>
Date: Wed, 20 Apr 2005 15:05:29 +0100

If I want to recognize a sequence of letters and characters such as 'atnnnn' where 'at' are letters and 'nnnn' are numbers between 0-9 what would be the best way to write a token for it ?

Would it be 'at0-9' or something else ?

Possibly trivial but could you please explain what the expression &#xA; in your token below signifies? Is the actual expression this or '#xA;' ?


Dimitre Novatchev wrote:

If you set the delimiters to:


then the result is:

<word>good morning</word>
<word> can you help me</word>
<word> alpha </word>
<word> alpha beta</word>

If you want, you may additionally normalize-space() in a second pass.

Cheers, Dimitre Novatchev

On 4/20/05, Ali Choumane <ali.choumane@xxxxxxx> wrote:

Thanks for your help, but in my project, if I have this entry:
<?xml version="1.0"?>
good morning; can you help me: alpha , alpha beta.

I would have such result:
<?xml version="1.0"?>
<sentence>good morning</sentence>
<sentence>can you help me<sentence>
<sentence>alpha beta<sentence>

but I think ,the solutions you provided give an output as:
<?xml version="1.0"?>

can you help me?

Quoting "Pawson, David" <David.Pawson@xxxxxxxxxxx>:

   -----Original Message-----
   From: Dimitre Novatchev

   Please, note that the str-split-to-words template generally
   handles *a
   set* of delimiters, not just one delim character.

Therefore, one can set the pDelimiters parameters say to:

" ,;?\!"

and it will retrieve words that are delimited by any pair of these.

which is exactly what was asked for earlier! Nice one. I've never needed multiple separators before. This guy did. regards DaveP


NOTICE: The information contained in this email and any attachments is
confidential and may be privileged.  If you are not the intended
recipient you should not use, disclose, distribute or copy any of the
content of it or of any attachment; you are requested to notify the
sender immediately of your receipt of the email and then to delete it
and any attachments from your system.

RNIB endeavours to ensure that emails and any attachments generated by
its staff are free from viruses or other contaminants.  However, it
cannot accept any responsibility for any  such which are transmitted.
We therefore recommend you scan all attachments.

Please note that the statements and views expressed in this email and
any attachments are those of the author and do not necessarily represent
those of RNIB.

RNIB Registered Charity Number: 226227


M2R III ,Univ Joseph fourrier, UFR IMA
portable.      06 71 55 80 65
Bureau(INRIA). 04 76 61 54 13
Email. ali.choumane@xxxxxxx

envoyi via Webmail/IMAG !

Current Thread