Subject: [xsl] Problem with encoding UTF-8 From: "David at roamware" <david@xxxxxxxxxxxxxx> Date: Wed, 15 Dec 2004 17:02:49 -0000 |
Hi, I have a collegue who is exporting a XML data file from a PDF (yes you can do that). He sent me the file, I ran it through my program using Xalan 2.5.1, xml-apis and xercesimpl.jar both from 31st July last year and it comes up with the very useless message "Premature end of file". Which I understand can mean enmcoding problems or XSD missing etc. If I take the file he sent me and in UltraEdit 32 use the UNICODE/UTF-8 -> UTF-8 conversion option save the file and then pop it through my program, all works fine. This ambiguous conversion is explained thus "This function will convert the complete file from Unicode or UTF-8 (ASCII representation) to UTF-8 (with the file internally as Unicode)" So I am at a bit of a loss to explain what the file format has to do with this, the PDF exports the file with the "encoding=UTF-8" in the xml element. Any experience of this behaviour and how to get around it? I cannot change what the PDF exports so it will have to be a "not strict" switch or something on the parser I suppose (couldn't find reference to such a thing mind you.). Thx. David Wynter
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
Re: [xsl] Space after <a> tag in te, David Carlisle | Thread | Re: [xsl] Problem with encoding UTF, Geert Josten |
Re: [xsl] Space after <a> tag in te, António Mota | Date | RE: [xsl] Space after <a> tag in te, Michael Kay |
Month |