Re: [xsl] plea for help...

Subject: Re: [xsl] plea for help...
From: "Walter Torres" <walter@xxxxxxxxx>
Date: Thu, 9 Mar 2006 10:59:04 -0600 (CST)
On Wed, March 8, 2006 5:30 pm, Alexander Johannesen wrote:
> Hi,
>
>
> On 3/9/06, Walter Torres <walter@xxxxxxxxx> wrote:
>
>> Well, now that I've sold the idea, I'm stumped on how to do it.
>
> It depends quite dramatically what those HTML files contain. If
> they're reasonably semantic instead of presentation based, it should
> be easy. The other way is damn hard. Here's what I would do ;
>
> 1. Try various ways to convert the HTML files to either Wiki notation
> or OpenDocument formats; get rid of presentation markup and focus on the
> semantics. 2. Convert the files from the Wiki notation to clean XHTML, and
> use CSS for presentation.

I found "Tidy HTML"

It looks like it's handling the files I've been throwing at it pretty well.


>> Also, how do you deal with display characteristics in XML? Like BOLD
>> and ITALIC, or even BULLET LISTS (<ul><li>)
>>
>
> This again depends on what you're trying to achieve with this
> conversion. XHTML *is* XML, so you're already there. If you're doing
> something semantically outside the scope of HTML with the files, what
> are they?

I'm looking to set these files down to a very "basic" format so that I can
display them in almost any media:
 * web pages, with various looks
 * RSS feeds for other sites to pull the data
 * text for print media
 * WML for phones and hand-helds
 * etc


> If you just want to display them back again,

Yes I do, as I outlined above...


> just leave them as XHTML. I'm sure the interesting part of the articles
> start and stop at some points, so extract out these only.

If they are left as xHTML won't that effect how this file is handed by
various "devices"?

Or am I over-thinking this?

walter

Current Thread