Re: [xsl] Validation XSLT using XSLT 1.0

Subject: Re: [xsl] Validation XSLT using XSLT 1.0
From: Michael Ludwig <mlu@xxxxxxxxxxxxx>
Date: Thu, 03 Jul 2008 10:33:41 +0200
Abel Braaksma schrieb:
Michael Ludwig wrote:

3. How to find non-ascii characters in the XML file and report an
error using XSLT.

Don't use XSLT for this. Add the following XML declaration to your input documents:

<?xml version="1.0" encoding="us-ascii"?>

This will ensure the document won't get parsed unless it is pure
ASCII.

Yet may still contain higher characters, which then will be escaped using numerical entity references... So, though the file will be US-ASCII, the contents does not necessarily fit in US-ASCII and still get parsed well.

That's true. I was lumping together the notions of character set and character encoding.

From the OP's specification, however, it's not entirely clear which one
is the requirement here. Unless you suppose he'd have written "seven-bit
clean bytes" instead of "non-ascii characters" had he wanted to talk
about bytes instead of characters.

Michael Ludwig

Current Thread