Re: [xsl] Efficient way to do an identity transform, eliminating duplicate elements, in XSLT 1.0?

Subject: Re: [xsl] Efficient way to do an identity transform, eliminating duplicate elements, in XSLT 1.0?
From: Dimitre Novatchev <dnovatchev@xxxxxxxxx>
Date: Fri, 13 Dec 2013 10:14:53 -0800
So, how big is "big"?

My new home PC I bought in May has 32GB RAM.

It seems obvious that paying $2000 for a decent computer is so much
cheaper than paying programmers (myself included) to invent tricks ---
at least for this kind of problems.





On Fri, Dec 13, 2013 at 6:33 AM, Costello, Roger L. <costello@xxxxxxxxx>
wrote:
> Hi Folks,
>
> I need to do an identity transform on XML files like this:
>
> <Document>
>     <First>
>         <id>A</id>
>         <blah>B</blah>
>         <id>A</id>
>     </First>
>     <Second>
>         <id>C</id>
>         <blah>D</blah>
>         <id>C</id>
>     </Second>
> </Document>
>
> I want the identity transform to remove duplicate elements in <First> and
remove duplicate elements in <Second>. So the output should be:
>
> <Document>
>     <First>
>         <id>A</id>
>         <blah>B</blah>
>     </First>
>     <Second>
>         <id>C</id>
>         <blah>D</blah>
>     </Second>
> </Document>
>
> I need to use XSLT 1.0 to implement this.
>
> I created an implementation, but it uses <copy> statements. The actual XML
document that I am transforming is huge, nearly 1 GB. When I run my XSLT
implementation the processor runs out of memory. I think it's due to the
<copy> statements. I need a very efficient implementation. Any suggestions?
>
> /Roger
>



--
Cheers,
Dimitre Novatchev
---------------------------------------
Truly great madness cannot be achieved without significant intelligence.
---------------------------------------
To invent, you need a good imagination and a pile of junk
-------------------------------------
Never fight an inanimate object
-------------------------------------
To avoid situations in which you might make mistakes may be the
biggest mistake of all
------------------------------------
Quality means doing it right when no one is looking.
-------------------------------------
You've achieved success in your field when you don't know whether what
you're doing is work or play
-------------------------------------
To achieve the impossible dream, try going to sleep.
-------------------------------------
Facts do not cease to exist because they are ignored.
-------------------------------------
Typing monkeys will write all Shakespeare's works in 200yrs.Will they
write all patents, too? :)
-------------------------------------
I finally figured out the only reason to be alive is to enjoy it.

Current Thread