Subject: Re: [xsl] Re: text() word lists From: David Carlisle <davidc@xxxxxxxxx> Date: Mon, 9 Feb 2004 10:55:13 GMT |
So what is the best way to parameterise these to allow turning on/off the removal of numbers? And while we're at it, turning on/off the removal of hyphens or other possibly-word-forming characters? The second argument to tokenize which is what is being used to specify the "inter word space/punctuation"can include or not the numbers, or hyphens etc, it is a general string valued Xpath so in particular you can make up the regexp on the fly using concat() or string-join() passing in some parameters as needed. taokenize(.,concat('(',$space,'|[$punct,$nums,$other,'])+')) then you can set <xsl:param name="space" select="'\s'"/> <xsl:param name="punct" select="'!.,;:\?'"/> <xsl:param name="nums" select="''"/> <!-- or '0-9' --> <xsl:param name="other" select="''"/> <!-- or 'whatever you want ' --> -- http://www.dcarlisle.demon.co.uk/matthew ________________________________________________________________________ This e-mail has been scanned for all viruses by Star Internet. The service is powered by MessageLabs. For more information on a proactive anti-virus service working around the clock, around the globe, visit: http://www.star.net.uk ________________________________________________________________________ XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
RE: [xsl] Re: text() word lists, James Cummings | Thread | [xsl] converting flat xml data into, David Buddrige |
RE: [xsl] converting flat xml data , Andreas L. Delmelle | Date | RE: [xsl] converting flat xml data , Andreas L. Delmelle |
Month |