Re: [xsl] Max size?

Michael Kay wrote:

I can't speak with any authority about Xalan, but my understanding is
that it builds the tree concurrently with doing the transformation; by
the time you've finished, you will normally have the complete tree in
memory. This has benefits, but the memory you need still increases
linearly with document size.

I don't think so. You can process much larger inputs once
incremental processing is enabled, and the style sheet
fits, without running out of memory. Well, it still runs
out of memory ultimately, but I don't think it's only
streaming the result, significant parts of information
associated with the input seems to be discarded too.

The interesting challenge is to work out when you can discard parts of
the tree that won't be needed again. I think this could be done quite
easily for a small class of very simple stylesheets, but the general
problem is quite hard.


I think it should be possible to assert by static analysis
whether a certain template only accesses descendants of the
context node. If this can be asserted for all templates in
the style sheet, and if you can arrange the processing
within a template so that nodes are only accessed once
locally, you can discard nodes processed by directly called
templates from memory.
Making such assertions shouldn't be that hard if the XPath
expressions within the templates use only nodes form the
descendant-or-self axis. It may be an indication that Xalan's
memory usage increases quite a bit for the same input if the
stylesheet uses a sibling axis somewhere, even if the same
result is produced.
One of the more interesting questions: If you have a schema
for the input and can afford to barf in mid-processing if
the input doesn't validate, the structure information should
allow far better assertions from static analysis.

J.Pietschmann

XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list

Current Thread
Re: [xsl] Max size?, (continued) Joseph Kesselman - Wed, 8 Jan 2003 13:02:54 -0500 Edward . Middleton - Thu, 9 Jan 2003 09:57:44 +0900 J.Pietschmann - Thu, 09 Jan 2003 02:18:49 +0100 Michael Kay - Thu, 9 Jan 2003 13:11:22 -0000 J.Pietschmann - Thu, 09 Jan 2003 15:05:18 +0100 <= Michael Kay - Thu, 9 Jan 2003 16:32:15 -0000 J.Pietschmann - Thu, 09 Jan 2003 20:26:45 +0100 Johannes Döbler - Thu, 09 Jan 2003 23:42:28 +0100 Michael Kay - Fri, 10 Jan 2003 09:41:51 -0000

Current Thread

Re: [xsl] Max size?, (continued)
- Joseph Kesselman - Wed, 8 Jan 2003 13:02:54 -0500
- Edward . Middleton - Thu, 9 Jan 2003 09:57:44 +0900
  - J.Pietschmann - Thu, 09 Jan 2003 02:18:49 +0100
    - Michael Kay - Thu, 9 Jan 2003 13:11:22 -0000
    - J.Pietschmann - Thu, 09 Jan 2003 15:05:18 +0100 <=
    - Michael Kay - Thu, 9 Jan 2003 16:32:15 -0000
    - J.Pietschmann - Thu, 09 Jan 2003 20:26:45 +0100
    - Johannes Döbler - Thu, 09 Jan 2003 23:42:28 +0100
    - Michael Kay - Fri, 10 Jan 2003 09:41:51 -0000

<- Previous	Index	Next ->
RE: [xsl] Max size?, Michael Kay	Thread	RE: [xsl] Max size?, Michael Kay
Re: [xsl] Beginner: adding xmlns:mm, Jeni Tennison	Date	RE: [xsl] adding a condition to a s, cknell
	Month

<-prev [Thread] next->	<-prev [Date] next->
Month Index \| List Home