Subject: Re: Extended characters in XT From: James Clark <jjc@xxxxxxxxxx> Date: Tue, 01 Sep 1998 14:14:05 +0700 |
Output is always in UTF-8. Input is in UTF-16, UTF-8, US-ASCII or ISO-8859-1 according to the XML encoding declaration. Unfortunately there's a bug in XP's ISO-8859-1 support that stops ISO-8859-1 input working (patch at the end). If you have a problem with XT (or any other software), it's much easier to give help if you include a complete set of files that illustrate the problem you're having. *** C:/home/work/java/com/jclark/xml/tok/rdif00C9.aaa Tue Sep 1 14:13:12 1998 --- C:/home/work/java/com/jclark/xml/tok/ISO8859_1Encoding.java Fri Aug 28 17:07:36 1998 *************** *** 3,9 **** /** * An Encoding for ISO 8859-1. * ! * @version $Revision: 1.2 $ $Date: 1998/02/17 04:51:04 $ */ final class ISO8859_1Encoding extends Encoding { --- 3,9 ---- /** * An Encoding for ISO 8859-1. * ! * @version $Revision: 1.3 $ $Date: 1998/08/28 10:07:26 $ */ final class ISO8859_1Encoding extends Encoding { *************** *** 35,41 **** int initTargetStart = targetStart; int c; while (sourceStart != sourceEnd) ! targetBuf[targetStart++] = (char)sourceBuf[sourceStart++]; return targetStart - initTargetStart; } --- 35,41 ---- int initTargetStart = targetStart; int c; while (sourceStart != sourceEnd) ! targetBuf[targetStart++] = (char)(sourceBuf[sourceStart++] & 0xFF); return targetStart - initTargetStart; } Tomas Eriksson wrote: > > Hello, > > Does anyone know how XT handles extended character sets (above 127). I am > having trouble mapping entities to Unicode (both decimal and Hex). > > Regards > > Tomas > > XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
Extended characters in XT, Tomas Eriksson | Thread | Re: Extended characters in XT, Tomas Eriksson |
Extended characters in XT, Tomas Eriksson | Date | Re: 2.6 patterns: let's try variati, Sean Mc grath |
Month |