Re: [xsl] An efficient XSLT program that searches a large XML document for all occurrences of a string?

Subject: Re: [xsl] An efficient XSLT program that searches a large XML document for all occurrences of a string?
From: "Liam R. E. Quin liam@xxxxxxxxxxxxxxxx" <xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx>
Date: Fri, 3 May 2024 01:05:31 -0000
On Thu, 2024-05-02 at 13:08 +0000, Roger L Costello costello@xxxxxxxxx
wrote:
>
> The input XML document is large, nearly 5GB.

Youbve probably considered this, but if you query the document more
than once, it may be worth putting it into e.g. BaseX, and using XQuery
instead of XSLT. The advantage of XQuery in a database is the index:
the database doesn't need to search the whole document to find text
nodes with values equal to DNKK for example.

Of course, BaseX might fall over onto its back and wiggle its legs in
the air at a 5GByte XML document, ibm not sure. Ibve had more than 5G
in one BaseX database i think, but not all in one file, it was the
collected works of Dorothy Parker

liam

--
Liam Quin,B https://www.delightfulcomputing.com/
Available for XML/Document/Information Architecture/XSLT/
XSL/XQuery/Web/Text Processing/A11Y training, work & consulting.
Barefoot Web-slave, antique illustrations: B http://www.fromoldbooks.org

Current Thread