Re: html to xml

Subject: Re: html to xml
From: Sebastian Rahtz <sebastian.rahtz@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>
Date: Fri, 27 Oct 2000 11:20:30 +0100
David Carlisle writes:
 > Surely grabbing floating PCDATA and sticking it in a paragraph element
 > is something easily done in the post tidy XSL transformation to TEI.

easily? take this:

 <h1>Intro</hi>
 I am the walrus
 <pre>Yes I am </pre>

 <p>A new paragraph


and insert a <p> around 

 I am the walrus
 <pre>Yes I am </pre>

....

 > Grabbing html section heads into TEI/docbook style section containers is
 > always a pain but you can do it in XSL with the usual "grouping"
 > techniques.
hmm, I hadnt thought of it as a group problem. you are right, tho,
thanks

 > It's made a bit easier if you know that the H? elements all
 > appear in "correct" sequence, not jumping from h1 to h3. If you use
 > ISO-HTML DTD then the SGML parser (eg sx ) will add any missing section
 > levels automagically if you set the appropriate parameter entity.

we live and learn. I didn't know that either. as if I had valid HTML
that sx would accept, of course!

sebastian


 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread