RE: [xsl] extracting content from html

Subject: RE: [xsl] extracting content from html
From: "Aron Bock" <aronbock@xxxxxxxxxxx>
Date: Wed, 11 May 2005 15:02:43 +0000
Aaron, from my brief sojourn on this ng, it seems its participants act somewhat like the editors of the "weddings" section of the Sunday NY Times: oftentimes weddings that you'd be *sure* would make the section, don't, while obscure ones do. Likewise, sometimes people post what they assume are clear, descriptive questions and are summarily asked to elaborate, while others post what barely qualifies as a half-formulated question and get a torrent of responses, some even with generously made-up data :-)

I believe Heisenberg (he of the uncertainty principle) pushed through a 2 page thesis for his doctorate. The rest of us may need to be more expansive.

Thus, on the whole, it's inadvisable to rely on the vagaries of the collective ng consciousness, but rather, to provide sample input and sample wanted output. You'll then likely get much help from the many knowledgeable, helpful participants here.

Regards,

--A

From: Aaron Johnson <Aaron2.Johnson@xxxxxxxxx>
Reply-To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
Subject: [xsl] extracting content from html
Date: Wed, 11 May 2005 15:20:04 +0100


Hello...


I have an html document that I want to do jobs.

Firstly...

I'd like it to be a stand alone document that people can browse to.

Secondly...

I'd like to extract content from the document to populate a page in a
portal using xslt.

Being xhtml, the document counts as well formed xml so it should in
theory be possible...

...I have no idea where to start!

Can anyone make any suggestions please?

..................................................................
Aaron Johnson
GUI / XSLT development
Academic Technologies Group [ATG]
University of the West of England
ext: 81051
t: 0117 3281051
www: http://atg.uwe.ac.uk/aaron
e: aaron2.johnson@xxxxxxxxx
SECURITY POLICY:
Please note that ATG will only accept
e-mail attachments in the following
formats:
.asp,.bin,.doc,.gif,.html,.jpg,.mdb,.png,
.psd,.shtml,.sdf,.sit,.xls,.xml,.xsd,.xsl,.zip,
..................................................................
This communication is intended solely
for the use of the individual(s) to whom
it is addressed. Any opinions presented
are those of the author and do not
necessarily represent the University of
the West of England, Bristol.
..................................................................



This email has been independently scanned for viruses and any virus software has been removed using McAfee anti-virus software


_________________________________________________________________
Is your PC infected? Get a FREE online computer virus scan from McAfee. Security. http://clinic.mcafee.com/clinic/ibuy/campaign.asp?cid=3963


Current Thread