Re: multiple input files to one output file

Subject: Re: multiple input files to one output file
From: Steve Tinney <stinney@xxxxxxxxxxxxx>
Date: Wed, 02 Feb 2000 08:41:13 -0500
If you can use Saxon for your application, take a good long look at the
docs for both the extension facilities and the API.  For a large set of
data like this it will probably be worth writing a Java program which
caches the stylesheet and applies it to the list of files.

You could either use Java's directory walking routines to generate the
file list, or have two stylesheets in your Java app.  The first could
process the XML files to extract the filenames/URLs; you could capture
the output of that internally, and then use it to drive the iteration of
the second sheet.

 Steve

Eric van der Vlist wrote:
> Linda is mentioning "thousands of files"...
> It may be worthwhile to check that the files are closed and the
> resources released for each iteration of the for-each loop.
> Do you know if it's the case ?
>
> Mike Brown wrote:
> >
> > > I do have XML files that specify the filenames and paths of all the
> > > documents in a specific subdirectory. Could I somehow read such a
> > > filenames-file and call the document function for each of the
> > > files listed, and then extract the information I want from each
> > > document?
> >
> > <xsl:for-each select="document('filenames_file.xml')/someURIs/file/text()">
> >   <xsl:variable name="current_file_root" select="document(string(.))"/>
> >   <!-- the next lines are just for example -->
> >   <xsl:text>&#xA;&#xA;current file: </xsl:text>
> >   <xsl:value-of select="."/>
> >   <xsl:text>&#xA;# of elements: </xsl:text>
> >   <xsl:value-of select="count($current_file_root//*)"/>
> > </xsl:for-each>
> >

-- 
----------------------------------------------------------------------
Steve Tinney                                        Babylonian Section
                                 *   University of Pennsylvania Museum
stinney@xxxxxxxxxxxxx                          Phila, PA. 215-898-4047


 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread