Subject: Re: [xsl] Validation XSLT using XSLT 1.0|
From: Michael Ludwig <mlu@xxxxxxxxxxxxx>
Date: Thu, 03 Jul 2008 10:33:41 +0200
Michael Ludwig wrote:
3. How to find non-ascii characters in the XML file and report an error using XSLT.
Don't use XSLT for this. Add the following XML declaration to your input documents:
<?xml version="1.0" encoding="us-ascii"?>
This will ensure the document won't get parsed unless it is pure ASCII.
Yet may still contain higher characters, which then will be escaped using numerical entity references... So, though the file will be US-ASCII, the contents does not necessarily fit in US-ASCII and still get parsed well.
That's true. I was lumping together the notions of character set and character encoding.
From the OP's specification, however, it's not entirely clear which one is the requirement here. Unless you suppose he'd have written "seven-bit clean bytes" instead of "non-ascii characters" had he wanted to talk about bytes instead of characters.