Re: [xsl] How to read the encoding of an XML document

Subject: Re: [xsl] How to read the encoding of an XML document
From: "Christopher R. Maden" <crism@xxxxxxxxx>
Date: Thu, 25 Oct 2001 13:33:49 -0700
At 12:59 25-10-2001, James Garriss wrote:
Ok. If you recall, I started this discussion by mentioning that I am receiving XML documents from several European countries. So the pertinent question for me is "if UTF-8 and/or UTF-16 will be the output encoding set I must use, will they handle charcters from the languages I care about?"

So it seems to me that I should be safe outputing my data to UTF-16. That make sense?

Yes. UTF-8 and UTF-16 both cover the entire Unicode repertoire. The difference is that that UTF-8 uses a different number of bytes for different characters, while UTF-16 uses 2 bytes for most characters. For European content, UTF-8 is usually a win; for Asian content, UTF-16 is generally better. But either can represent the entire Unicode repertoire.


-Chris
--
Christopher R. Maden, Principal Consultant, HMM Consulting Int'l, Inc.
DTDs/schemas - conversion - ebooks - publishing - Web - B2B - training
<URL: http://www.hmmci.com/ > <URL: http://crism.maden.org/consulting/ >
PGP Fingerprint: BBA6 4085 DED0 E176 D6D4  5DFC AC52 F825 AFEC 58DA


XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list



Current Thread