Re: [xsl] Need XPath 2.0 expression which returns a non-empty paragraph element that is preceded by a long uninterrupted series of empty paragraph elements

Subject: Re: [xsl] Need XPath 2.0 expression which returns a non-empty paragraph element that is preceded by a long uninterrupted series of empty paragraph elements
From: "Michael Kay mike@xxxxxxxxxxxx" <xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx>
Date: Tue, 26 Nov 2019 01:12:26 -0000
Does it have to be XPath or will XSLT do?

<xsl:for-each-group select="body/*" group-ending-with="p[not(. = '&#x160')]">
   <xsl:sequence select="current-group()[last()[. gt 20]][not(. =
'&#x160')]"/>
<xsl:for-each-group>

Michael Kay
Saxonica

> On 25 Nov 2019, at 19:39, Costello, Roger L. costello@xxxxxxxxx
<xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx> wrote:
>
> Hi Folks,
>
> I want to know if an XHTML document contains a non-empty paragraph (p)
element that is preceded by a long, uninterrupted series of paragraph
elements, each containing just a non-blocking space character (decimal 160).
Let's assume that "long" means 20. For example, here is an excerpt of an XHTML
document:
>
> <body>
>    <p>Text at top</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>Text at bottom</p>
> </body>
>
> The query should return this:
>
> <p>Text at bottom</p>
>
> because it is preceded by a long, uninterrupted series of paragraph
elements, each containing just a non-blocking space character.
>
> Here's a query that returns the desired paragraph element:
>
> //p[string-length(.) gt 1][count(preceding-sibling::p[. eq '&#160;']) ge
20]
>
> However, if I insert a non-empty paragraph element in the middle of that
long series:
>
> <body>
>    <p>Text at top</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>Other text</p>  <------------
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>&#160;</p>
>    <p>Text at bottom</p>
> </body>
>
> then my query erroneously returns the same paragraph element. That is, my
XPath query does not account for the requirement that the long series of
paragraph elements be uninterrupted. How to write an XPath 2.0 query for
this?
>
> /Roger

Current Thread