Re: [xsl] cleanup of <div>-elements

Subject: Re: [xsl] cleanup of <div>-elements
From: "Piez, Wendell A. (Fed) wendell.piez@xxxxxxxx" <xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx>
Date: Mon, 27 Feb 2023 17:40:22 -0000
Hi,

I concur with what Mike Kay says indeed I think it's super important (a
phrasing I would not have used in 2003).

It's also important to note that projects such as XSpec make it possible to
unit test XSLT this way, when without XPec or something like it, unit testing
is dauntingly complex for a solo dev or small shop.

The shape of the problem overall, as Mike notes, changes when the inputs scale
up and out (there are lots and lots, and/or they are uncontrolled or poorly
controlled). What happens upstream and how much of it there is will matter a
lot - XSLT is great for managing this complexity but only if you can see it.

Cheers, Wendell



From: Michael Kay michaelkay90@xxxxxxxxx
<xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx>
Sent: Monday, February 27, 2023 12:23 PM
To: xsl-list <xsl-list@xxxxxxxxxxxxxxxxxxxxxx>
Subject: Re: [xsl] cleanup of <div>-elements


In XSLT it is often easy to implement if it is easy to define.

Indeed, this applies to any language. But XSLT does have an advantage here,
which is that you can incrementally add rules as you discover more and more
oddities in the input. If you're dealing with a large number of input files
then the big problem becomes quality control: if you've got more documents
than you can "eyeball", you're going to need to do some automated checking
that you haven't introduced corruptions. It might be an idea to make this
exercise test-driven: start by defining the requirements in terms of
executable tests that the output must satisfy, and only then start writing the
code to actually do the transformations.

Michael Kay
Saxonica
XSL-List info and
archive<https://gcc02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.
mulberrytech.com%2Fxsl%2Fxsl-list&data=05%7C01%7Cwendell.piez%40nist.gov%7Cd3
c1820b2ca340aafd8308db18e73a68%7C2ab5d82fd8fa4797a93e054655c61dec%7C1%7C0%7C6
38131153661802407%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzI
iLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=PFtTXjaZEkJduv7uTpzTYHZ
iLkiQmrCKadQ8tKe5xGM%3D&reserved=0>
EasyUnsubscribe<https://gcc02.safelinks.protection.outlook.com/?url=http%3A%2
F%2Flists.mulberrytech.com%2Funsub%2Fxsl-list%2F3302254&data=05%7C01%7Cwendel
l.piez%40nist.gov%7Cd3c1820b2ca340aafd8308db18e73a68%7C2ab5d82fd8fa4797a93e05
4655c61dec%7C1%7C0%7C638131153661802407%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wL
jAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=7
%2By9WJIXxDErnuBvzx1t8xG%2BB0cuUKp%2BhCLPfu%2F51qY%3D&reserved=0> (by
email<>)

Current Thread