Re: [xsl] nbsp is not that hard, folks

Subject: Re: [xsl] nbsp is not that hard, folks
From: Mike Brown <mike@xxxxxxxx>
Date: Sat, 9 Nov 2002 06:48:33 -0700 (MST)
Américo Albuquerque wrote:
> So, what you are saying is that &nbsp; is to XML and HTML has "#define
> nbsp" is to C??

Sigh. No. Think of it like this:

In XSLT you may have <xsl:variable name="nbsp" select="'&#160;'"/>.
This is like <!ENTITY nbsp "&#160;"> in an XML DTD.

In XSLT you may have $nbsp in an XPath expression.
This is like &nbsp; in an XML or HTML document.


In XML you can define your own entities. By default, you only get 5:
quot, apos, amp, lt, gt. Each of these corresponds to just 1 character:
", ', &, <, and >, respectively.


In HTML you cannot define your own. But by default, you get about 252:

quot, amp, lt, gt, nbsp, iexcl, cent, pound, curren, yen, brvbar, sect, uml, 
copy, ordf, laquo, not, shy, reg, macr, deg, plusmn, sup2, sup3, acute, micro, 
para, middot, cedil, sup1, ordm, raquo, frac14, frac12, frac34, iquest, 
Agrave, Aacute, Acirc, Atilde, Auml, Aring, AElig, Ccedil, Egrave, Eacute, 
Ecirc, Euml, Igrave, Iacute, Icirc, Iuml, ETH, Ntilde, Ograve, Oacute, Ocirc, 
Otilde, Ouml, times, Oslash, Ugrave, Uacute, Ucirc, Uuml, Yacute, THORN, 
szlig, agrave, aacute, acirc, atilde, auml, aring, aelig, ccedil, egrave, 
eacute, ecirc, euml, igrave, iacute, icirc, iuml, eth, ntilde, ograve, oacute, 
ocirc, otilde, ouml, divide, oslash, ugrave, uacute, ucirc, uuml, yacute, 
thorn, yuml, fnof, Alpha, Beta, Gamma, Delta, Epsilon, Zeta, Eta, Theta, Iota, 
Kappa, Lambda, Mu, Nu, Xi, Omicron, Pi, Rho, Sigma, Tau, Upsilon, Phi, Chi, 
Psi, Omega, alpha, beta, gamma, delta, epsilon, zeta, eta, theta, iota, kappa, 
lambda, mu, nu, xi, omicron, pi, rho, sigmaf, sigma, tau, upsilon, phi, chi, 
psi, omega, thetasym, upsih, piv, bull, hellip, prime, Prime, oline, frasl, 
weierp, image, real, trade, alefsym, larr, uarr, rarr, darr, harr, crarr, 
lArr, uArr, rArr, dArr, hArr, forall, part, exist, empty, nabla, isin, notin, 
ni, prod, sum, minus, lowast, radic, prop, infin, ang, and, or, cap, cup, int, 
there4, sim, cong, asymp, ne, equiv, le, ge, sub, sup, nsub, sube, supe, 
oplus, otimes, perp, sdot, lceil, rceil, lfloor, rfloor, lang, rang, loz, 
spades, clubs, hearts, diams, OElig, oelig, Scaron, scaron, Yuml, circ, tidle, 
ensp, emsp, thinsp, zwnj, zwj, lrm, rlm, ndash, mdash, lsquo, rsquo, sbquo, 
ldquo, rdquo, bdquo, dagger, Dagger, permil, lsaquo, rsaquo, euro.

Each of these has the replacement text of just one character. That's why they
are called "character entities". They cover a variety of Unicode characters
all the way up to &#9830;.

   - Mike
____________________________________________________________________________
  mike j. brown                   |  xml/xslt: http://skew.org/xml/
  denver/boulder, colorado, usa   |  resume: http://skew.org/~mike/resume/

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread