Re: [xsl] text extraction

Subject: Re: [xsl] text extraction
From: David Carlisle <davidc@xxxxxxxxx>
Date: Thu, 12 Oct 2006 17:02:32 +0100
> Note that ISO-8859-1 is an order of magnitude smaller then UTF-8, so you 
> may end up with missing or replaced characters (not sure what they will 
> be replaced with though, when they don't exist) in the output stream.

XSLT will not drop or replace characters.

If the character is in a place where a character reference may be used,
the character will be serialised as a character reference (& # 1234 ;) 
If it is at a place where a character ref can not be used, (eg an
element name, or in a comment, or anywhere in the text output method,
then that's an error) The processor may avoid these errors by instead
using utf8 or iutf16 (which can encode all characters).

David

Current Thread