Re: [xsl] page-number-citation

Subject: Re: [xsl] page-number-citation
From: Tokushige Kobayashi <koba@xxxxxxxxxxxxx>
Date: Wed, 30 Jan 2002 21:09:39 +0900
問題は、1ページに同じ索引語が多数回現れた場合だと思います。

索引
index .. <fo:page-number-citation ref-id="1"/>,<fo:page-number-citation ref-id="2"/>,<fo:page-number-citation ref-id="3"/>,<fo:page-number-citation ref-id="4"/>
本文

Page 100
<fo:block id="1">index</fo:block> ....
.....<fo:block id="2">index</fo:block>
............<fo:block id="3">index</fo:block>

Page 101
<fo:block id="4">index</fo:block> ....


となると、索引は、

index 100,100,100,101

となるので奇麗でない、といっていると思います。
hollmanさんの解決策は、

>Would someone confirm please, its not possible
>to generate an index of the form
>
><fo:block> index entry <fo:page-number-citation ref-id="abd"/>,
>         <fo:page-number-citation ref-id="abd"/>
></fo:block>
>
>Such that repeated page-numbers are not duplicated?

Yes, this is not possible in XSLFO 1.0.

>I have a number of 'entries' per page, and the generated index
>looks ..... not nice.

I'm working on indexing my electronic publications, and my methodology is 
becoming fairly extensive (which is why it isn't completed yet):

   (1) - format printable XML markup in the PDF where attribute
         the visible markup of attributes includes the formatted
         page numbers ... e.g.:
           &lt;entry page="<page-number-citation ref-id="abd"/>"&gt;...
   (2) - extract the text from the PDF of the index as a well-
         formed XML document ... e.g.:
           <entry page="12">...
   (3) - run a stylesheet that interprets the page numbers and
         does the reduction of the page ranges, including sorting,
         index levels, etc.
   (4) - format the page ranges into the formatted index
   (5) - delete the XML index from the PDF
   (6) - append the formatted index to the PDF

I've determined from my tests that the above methodology works ... it is 
just a matter of finding the time to do it all in my publishing environment.

I plan to include the methodology as an annex in a future edition of the 
XSLFO book.

---------------------------------------------------------
次の人は、FOPの実装をしている人のようです。

"W. Eliot Kimber" <eliot@xxxxxxxxxx> wrote:
> > At 2002-01-29 14:53 +0000, DPawson@xxxxxxxxxxx wrote:
> > >to generate an index ...
> > >Such that repeated page-numbers are not duplicated?
> Has anyone approached this problem using extension functions?

I don't know about ready-to-use-solution, but there is some
need articulated regularly. Look at
  http://marc.theaimsgroup.com/?l=fop-user&m=101232223805802&w=2
The idea was to place extension elements where indexed items
appear, which would write a XML file with page references
during normal PDF generation. This file will be used to generate
the index using techniques already discussed here. Something
like the TEX approach.
Perhaps i'll get some time to actually implement a sample in FOP.

Regards
J.Pietschmann



"G.> At 2002-01-29 14:53 +0000, DPawson@xxxxxxxxxxx wrote:
"G.> >Would someone confirm please, its not possible
"G.> >to generate an index of the form
"G.> >
"G.> ><fo:block> index entry <fo:page-number-citation ref-id="abd"/>,
"G.> >         <fo:page-number-citation ref-id="abd"/>
"G.> ></fo:block>
"G.> >
"G.> >Such that repeated page-numbers are not duplicated?
"G.> 
"G.> Yes, this is not possible in XSLFO 1.0.
"G.> 
"G.> >I have a number of 'entries' per page, and the generated index
"G.> >looks ..... not nice.
"G.> 
"G.> I'm working on indexing my electronic publications, and my methodology is 
"G.> becoming fairly extensive (which is why it isn't completed yet):
"G.> 
"G.>    (1) - format printable XML markup in the PDF where attribute
"G.>          the visible markup of attributes includes the formatted
"G.>          page numbers ... e.g.:
"G.>            &lt;entry page="<page-number-citation ref-id="abd"/>"&gt;...
"G.>    (2) - extract the text from the PDF of the index as a well-
"G.>          formed XML document ... e.g.:
"G.>            <entry page="12">...
"G.>    (3) - run a stylesheet that interprets the page numbers and
"G.>          does the reduction of the page ranges, including sorting,
"G.>          index levels, etc.
"G.>    (4) - format the page ranges into the formatted index
"G.>    (5) - delete the XML index from the PDF
"G.>    (6) - append the formatted index to the PDF
"G.> 
"G.> I've determined from my tests that the above methodology works ... it is 
"G.> just a matter of finding the time to do it all in my publishing environment.
"G.> 
"G.> I plan to include the methodology as an annex in a future edition of the 
"G.> XSLFO book.
"G.> 
"G.> I hope this helps.
"G.> 
"G.> .......................... Ken
"G.> 
"G.> 
"G.> --
"G.> Upcoming: 3-days XSLT/XPath and/or 2-days XSLFO - Feb 18-22, 2002
"G.> 
"G.> G. Ken Holman                mailto:gkholman@xxxxxxxxxxxxxxxxxxxx
"G.> Crane Softwrights Ltd.         http://www.CraneSoftwrights.com/s/
"G.> Box 266, Kars, Ontario CANADA K0A-2E0 +1(613)489-0999 (Fax:-0995)
"G.> ISBN 0-13-065196-6                        Definitive XSLT & XPath
"G.> ISBN 1-894049-08-X  Practical Transformation Using XSLT and XPath
"G.> ISBN 1-894049-07-1               Practical Formatting Using XSLFO
"G.> XSL/XML/DSSSL/SGML/OmniMark services, books(electronic, printed),
"G.> articles, training(instructor-live,Internet-live,web/CD,licensed)
"G.> Next public training:  02-02-11,12,14,15,18,21,03-04,05,06,08,11,
"G.> -                                04-08,09,10,12,05-14,15,06-04,07
"G.> 
"G.> 
"G.>  XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
"G.> 
"G.> 


Tokushige Kobayashi
Antenna House, Inc.
E-mail koba@xxxxxxxxxxxxx
WWW    http://www.antenna.co.jp/
WWW    http://www.antennahouse.com/ (English)
TEL    +81-3-3234-1361(direct call)
FAX    +81-3-3221-9975

Antenna House XSL School

*February 1st, 2002, XSL School in Tokyo (Japanese)
http://www.antenna.co.jp/XML/school/xslday.htm



 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread