RE: [xsl] Converting HTML to plain text

Subject: RE: [xsl] Converting HTML to plain text
From: Wendell Piez <wapiez@xxxxxxxxxxxxxxxx>
Date: Tue, 22 Jun 2004 11:11:03 -0400
At 09:09 PM 6/21/2004, Jelmer wrote:
>Generally, XSLT can't be used on arbitrary HTML. A poor man's solution is
>to use a tool like HTML Tidy to make XML for XSLT from the HTML, but I
>don't know if that could be adapted to your requirement for "a platform
>independent way" (IIRC it is compiled for different platforms).

http://jtidy.sourceforge.net/

Also there's a hint on John Cowan's "Tag Soup" page that suggests that Saxon's parser includes his utility to handle HTML tag soup. Worth a look.


Cheers,
Wendell


====================================================================== Wendell Piez mailto:wapiez@xxxxxxxxxxxxxxxx Mulberry Technologies, Inc. http://www.mulberrytech.com 17 West Jefferson Street Direct Phone: 301/315-9635 Suite 207 Phone: 301/315-9631 Rockville, MD 20850 Fax: 301/315-8285 ---------------------------------------------------------------------- Mulberry Technologies: A Consultancy Specializing in SGML and XML ======================================================================


Current Thread