Re: Extended characters in XT

Subject: Re: Extended characters in XT
From: James Clark <jjc@xxxxxxxxxx>
Date: Tue, 01 Sep 1998 14:14:05 +0700
Output is always in UTF-8.  Input is in UTF-16, UTF-8, US-ASCII or
ISO-8859-1 according to the XML encoding declaration.  Unfortunately
there's a bug in XP's ISO-8859-1 support that stops ISO-8859-1 input
working (patch at the end).

If you have a problem with XT (or any other software), it's much easier
to give help if you include a complete set of files that illustrate the
problem you're having.

*** C:/home/work/java/com/jclark/xml/tok/rdif00C9.aaa Tue Sep  1
14:13:12 1998
--- C:/home/work/java/com/jclark/xml/tok/ISO8859_1Encoding.java Fri Aug
28 17:07:36 1998
***************
*** 3,9 ****
  /**
   * An Encoding for ISO 8859-1.
   *
!  * @version $Revision: 1.2 $ $Date: 1998/02/17 04:51:04 $
   */
  final class ISO8859_1Encoding extends Encoding {
  
--- 3,9 ----
  /**
   * An Encoding for ISO 8859-1.
   *
!  * @version $Revision: 1.3 $ $Date: 1998/08/28 10:07:26 $
   */
  final class ISO8859_1Encoding extends Encoding {
  
***************
*** 35,41 ****
      int initTargetStart = targetStart;
      int c;
      while (sourceStart != sourceEnd) 
!       targetBuf[targetStart++] = (char)sourceBuf[sourceStart++];
      return targetStart - initTargetStart;
    }
  
--- 35,41 ----
      int initTargetStart = targetStart;
      int c;
      while (sourceStart != sourceEnd) 
!       targetBuf[targetStart++] = (char)(sourceBuf[sourceStart++] &
0xFF);
      return targetStart - initTargetStart;
    }
  


Tomas Eriksson wrote:
> 
> Hello,
> 
> Does anyone know how XT handles extended character sets (above 127). I am
> having trouble mapping entities to Unicode (both decimal and Hex).
> 
> Regards
> 
> Tomas
> 
>  XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread