Re: [xsl] text extraction

> <E1> text1 <E2> text2 </E2> text3 </E1>

> I want to have something like:
> text1 text2 text3

Folks have indicated that you can take advantage of the natural
processing/handling that XSLT defines, so that something like
this would, for your example XML, emit what you wanted:

<?xml version="1.0" encoding="UTF-8"?>
<xsl:stylesheet
  xmlns:xsl="http://www.w3.org/1999/XSL/Transform"; version="2.0">
  <xsl:output method="text"/>
</xsl:stylesheet>

But if your markup was more complicated, so it had embedded elements
within E1 that you wanted to ignore, you could walk through each node
in the document, and on text nodes with the proper parent, emit the
normalized string:

<?xml version="1.0" encoding="UTF-8"?>
<xsl:stylesheet
  xmlns:xsl="http://www.w3.org/1999/XSL/Transform"; version="2.0">
  <xsl:output method="text"/>
  <xsl:template match="node()">
    <xsl:apply-templates select="node()"/>
  </xsl:template>
  <xsl:template match="text()[parent::*[self::E1|self::E2]]">
    <xsl:sequence select="normalize-space(.)"/>
  </xsl:template>
</xsl:stylesheet>

That would let you handle, for example, something like

<V><E1>text<E2>text2<baz>smorth</baz></E2>text3<flober>chum</flober></E1></V>

Jim

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
James A. Robinson                       jim.robinson@xxxxxxxxxxxx
Stanford University HighWire Press      http://highwire.stanford.edu/
+1 650 7237294 (Work)                   +1 650 7259335 (Fax)

Current Thread
[xsl] text extraction mus47 - Thu, 12 Oct 2006 14:35:09 +0200 (CEST) Colin Adams - Thu, 12 Oct 2006 13:40:51 +0100 Florent Georges - Thu, 12 Oct 2006 14:41:32 +0200 (CEST) James A. Robinson - Thu, 12 Oct 2006 06:50:07 -0700 <= James A. Robinson - Thu, 12 Oct 2006 07:09:57 -0700 David Carlisle - Thu, 12 Oct 2006 15:17:15 +0100 Andrew Welch - Thu, 12 Oct 2006 15:29:19 +0100 Florent Georges - Thu, 12 Oct 2006 16:32:23 +0200 (CEST)

Current Thread

[xsl] text extraction
- mus47 - Thu, 12 Oct 2006 14:35:09 +0200 (CEST)
  - Colin Adams - Thu, 12 Oct 2006 13:40:51 +0100
  - Florent Georges - Thu, 12 Oct 2006 14:41:32 +0200 (CEST)
  - James A. Robinson - Thu, 12 Oct 2006 06:50:07 -0700 <=
    - James A. Robinson - Thu, 12 Oct 2006 07:09:57 -0700
      - David Carlisle - Thu, 12 Oct 2006 15:17:15 +0100
      - Andrew Welch - Thu, 12 Oct 2006 15:29:19 +0100
      - Florent Georges - Thu, 12 Oct 2006 16:32:23 +0200 (CEST)

<- Previous	Index	Next ->
Re: [xsl] text extraction, Florent Georges	Thread	[xsl] text() vs xs:string serializa, James A. Robinson
Re: [xsl] text extraction, Florent Georges	Date	[xsl] text() vs xs:string serializa, James A. Robinson
	Month

<-prev [Thread] next->	<-prev [Date] next->
Month Index \| List Home