Subject: [xsl] Pencil and Paper UTF-8 From: Gannon Dick <gannon_dick@xxxxxxxxx> Date: Thu, 17 Jan 2013 07:51:31 -0800 (PST) |
Also Re: Saxon 9.4 <bold></bold> Transformed to (newline)</bold> Problem My suggestion would be to flatten New Lines (etc.) to the Unicode "picture" (␤, I think). Other whitespace can be handled the same way. This may help[1]. Hyperlinks and "plain text" are not always what they seem once the Search Engines search and Lawyers lawyer. So, the "link engineering" you might want to handle another way[2]. And in the real (American English, anyway) world you have *nix heads who never met a (possibly irreversible un-documentable) short-cut they didn't like. Before you allow (lazy) people to flatten all diacritics out of Plain Text, try this XSLT[3]. You should download a copy and look at the source because some of the C1 Controls do not display in browsers, when in fact they have been hijacked by Office Suites for use as proprietary meta characters and need to be promoted/demoted to lose the proprietary meaning. --Gannon [1] http://www.rustprivacy.org/2013/egov/penserv/ [2] http://www.rustprivacy.org/2013/egov/gateway/ [3] http://www.rustprivacy.org/2013/weather/escapeHTMLent.xsl
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
[xsl] XML Prague Call for Poster 20, Jim Fuller | Thread | [xsl] Does 'Lecœur' occur in $text, Costello, Roger L. |
RE: [xsl] Saxon 9.4 <bold></bold> T, Raymond Lillibridge | Date | [xsl] Does 'Lecœur' occur in $text, Costello, Roger L. |
Month |