Subject: Re: [xsl] resolve html entities From: David Carlisle <davidc@xxxxxxxxx> Date: Mon, 31 Oct 2005 09:48:43 GMT |
> 2.) get a fitting dtd/schema which maps these entities to unicode characters > > Would either one be a good starting point? It would have to be a dtd (schema's don't do entity definitions) This is the "standard" way of doing this so long as the "html" you are getting is well formed xml. But most html isn't even valid html never mind being well formed, in which case, as Michael said, using tag soup is a better option as it is designed to forgive at places where a browser would forgive (but an xml parser would give a fatal error).. David ________________________________________________________________________ This e-mail has been scanned for all viruses by Star. The service is powered by MessageLabs. For more information on a proactive anti-virus service working around the clock, around the globe, visit: http://www.star.net.uk ________________________________________________________________________
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
Re: [xsl] resolve html entities, Maximilian Gärber | Thread | Re: [xsl] resolve html entities, Maximilian Gärber |
Re: [xsl] Acces nodes [help], Ana Gaspar Martínez | Date | Re: [xsl] document() source, andrew welch |
Month |