Re: [xsl] Question on duplicate node elimination

But how could the algorithm step of "duplicate elimination" be done?
How can the duplicates be determined and removed, correctly?

What makes you think it would be difficult?

Of course, a processor needs some way to decide whether two nodes are identical/distinct. Given such a mechanism, it's not difficult to come up with algorithms that eliminate duplicate nodes.

In practice, when XPath 1.0 is used as part of XSLT 1.0, the XPath requirement to eliminate duplicates can always be combined with the XSLT requirement to deliver the node-set sorted in document order. So the natural way to eliminate duplicates is as part of the sorting process.

For performance, the most important technique is static analysis to identify those path expressions where the sort (and duplicate elimination) are unnecessary. For example, this is the case for the expression /a/b/c if it is evaluated either (a) using nested loops, or (b) by scanning the entire source document looking for nodes that match this pattern. For the expression //x//y, a sort is necessary if the evaluation uses nested loops, but not if it uses a whole-document scan and pattern matching. Remember that the evaluation techniques used internally may be very different from the descriptions you find in explanations of the semantics of the language.

The way you have phrased the question suggests that you might be worrying about how exslt:node-set() affects the process. Simple answer - it doesn't.

Michael Kay
Saxonica

Current Thread
[xsl] Question on duplicate node elimination Hermann Stamm-Wilbrandt - 22 Aug 2010 21:25:23 -0000 Michael Kay - 22 Aug 2010 21:53:45 -0000 Hermann Stamm-Wilbrandt - 22 Aug 2010 22:12:58 -0000 Michael Kay - 22 Aug 2010 22:36:14 -0000 <= Hermann Stamm-Wilbrandt - 23 Aug 2010 08:24:02 -0000 Michael Kay - 23 Aug 2010 09:58:07 -0000 Hermann Stamm-Wilbrandt - 23 Aug 2010 12:36:56 -0000 Lars Huttar - 23 Aug 2010 20:54:24 -0000

Current Thread

[xsl] Question on duplicate node elimination
- Hermann Stamm-Wilbrandt - 22 Aug 2010 21:25:23 -0000
  - Michael Kay - 22 Aug 2010 21:53:45 -0000
    - Hermann Stamm-Wilbrandt - 22 Aug 2010 22:12:58 -0000
      - Michael Kay - 22 Aug 2010 22:36:14 -0000 <=
      - Hermann Stamm-Wilbrandt - 23 Aug 2010 08:24:02 -0000
      - Michael Kay - 23 Aug 2010 09:58:07 -0000
      - Hermann Stamm-Wilbrandt - 23 Aug 2010 12:36:56 -0000
      - Lars Huttar - 23 Aug 2010 20:54:24 -0000

<- Previous	Index	Next ->
Re: [xsl] Question on duplicate nod, Hermann Stamm-Wilbra	Thread	Re: [xsl] Question on duplicate nod, Hermann Stamm-Wilbra
Re: [xsl] Question on duplicate nod, Hermann Stamm-Wilbra	Date	[no subject], Unknown
	Month

<-prev [Thread] next->	<-prev [Date] next->
Month Index \| List Home