RE: [xsl] java Regex call

Subject: RE: [xsl] java Regex call
From: David.Pawson@xxxxxxxxxxx
Date: Thu, 10 Jul 2003 11:31:47 +0100
Just curious Jeni,
Using: 

<xsl:output method="xml" indent="yes"/>
  <xsl:variable name="regex" select="'(([^_]*)_PARA)'"/>
  <xsl:variable name="input" select="'ABC_PARA__PARA'"/>
  <xsl:template match="/">
   <xsl:analyze-string select="$input" regex="{$regex}">
      <xsl:matching-substring>
        <RULE_MATCH>
          Input    <xsl:value-of select="$input" />
        </RULE_MATCH>
        <REGEX><xsl:copy-of select="$regex" /></REGEX>
        <GROUP1><xsl:value-of select="regex-group(1)" /></GROUP1>
        <GROUP2><xsl:value-of select="regex-group(2)" /></GROUP2>
        <GROUP3><xsl:value-of select="regex-group(3)" /></GROUP3>
.
.
      </xsl:matching-substring>
      <xsl:non-matching-substring>
        <mismatch><xsl:value-of select="."/></mismatch>
      </xsl:non-matching-substring>
    </xsl:analyze-string>

  </xsl:template>

produces

<?xml version="1.0" encoding="UTF-8"?>
<RULE_MATCH>
          Input    ABC_PARA__PARA</RULE_MATCH>
<REGEX>(([^_]*)_PARA)</REGEX>
<GROUP1>ABC_PARA</GROUP1>
<GROUP2>ABC</GROUP2>
<GROUP3/>
.
.
      <mismatch>_</mismatch>
<RULE_MATCH>
          Input    ABC_PARA__PARA</RULE_MATCH>
<REGEX>(([^_]*)_PARA)</REGEX>
<GROUP1>_PARA</GROUP1>
<GROUP2/>
<GROUP3/>
.
.

Why are there two lots of output please? 

regards DaveP


> -----Original Message-----
> From: Jeni Tennison 
> If you're using Saxon 7.6, why not use the built-in regular expression
> support rather than Java extensions?
> 
> > Example: if my regex is defined as:-
> >
> >  (([^_]*)_PARA)|((.*?)(PARA)(.*?))
> >
> >  and my input is "ABC_PARA"
> >
> >  Then I need to know what portions of the input matched
> >  each (if any) part of the regex groups, ([^_]*) and (.*?) etc,.
> >  in terms of group number and matching string.
> 
> If you use <xsl:analyze-string> then within the
> <xsl:matching-substring>, the regex-group() function gives you access
> to the values of the regex groups. Try:
> 
>   <xsl:for-each select="$rule//Type">
>     <xsl:variable name="tdlType" select="." />
>     <xsl:analyze-string select="." regex="{$regex}">
>       <xsl:matching-substring>
>         <RULE_MATCH>
>           <xsl:value-of select="$rule//@name" />
>         </RULE_MATCH>
>         <REGEX><xsl:copy-of select="$regex" /></REGEX>
>         <GROUP1><xsl:value-of select="regex-group(1)" /></GROUP1>
>         <GROUP2><xsl:value-of select="regex-group(2)" /></GROUP2>
>         <GROUP3><xsl:value-of select="regex-group(3)" /></GROUP3>
>         ...
>       </xsl:matching-substring>
>     </xsl:analyze-string>
>   </xsl:for-each>
> 
> Of course you need to know what regular expression is being used to be
> able to tell which regex groups you're actually interested in...
> 
> By the way, I'm not sure why you're using "//" in $rule//@name -- this
> will return all the name attributes of all the elements that are
> descendants of $rule, as well as of $rule itself, and return them in a
> whitespace-separated list (this being XSLT 2.0). Is that what you
> really want?
> 
> Cheers,
> 
> Jeni
> 
> ---
> Jeni Tennison
> http://www.jenitennison.com/
> 
> 
>  XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
> 

- 

NOTICE: The information contained in this email and any attachments is 
confidential and may be legally privileged. If you are not the 
intended recipient you are hereby notified that you must not use, 
disclose, distribute, copy, print or rely on this email's content. If 
you are not the intended recipient, please notify the sender 
immediately and then delete the email and any attachments from your 
system.

RNIB has made strenuous efforts to ensure that emails and any 
attachments generated by its staff are free from viruses. However, it 
cannot accept any responsibility for any viruses which are 
transmitted. We therefore recommend you scan all attachments.

Please note that the statements and views expressed in this email 
and any attachments are those of the author and do not necessarily 
represent those of RNIB.

RNIB Registered Charity Number: 226227

Website: http://www.rnib.org.uk 

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread