Subject: Re: [jats-list] Translations for section titles From: "Gareth Oakes goakes@xxxxxxx" <jats-list-service@xxxxxxxxxxxxxxxxxxxxxx> Date: Fri, 6 May 2022 11:32:18 -0000 |
Hi all, I must admit to ignorance, having not attended the JATSCon presentations, but it seems a mistake to try and overload document elements with multi-lingual features. In simple cases Ibm sure it works out but more complicated cases may require each language to be in its own document. For example, an English document with index where the Mandarin equivalent has the index entries with an entirely different sort order. What webve done in complex cases is use unique IDs to identify relevant structural elements across the individual language variants. These IDs logically provide bsynchronisationb points in each document where we know the content will align. The IDs are unique within the source document but shared across the language variants. A trivial example would be a French document with 8 paragraphs and two headings with English equivalent having only 6 paragraphs plus two headings. You can align at each heading and give it a unique ID for bsyncb purposes. You can even ID the paragraphs (although two IDs are missing in the English example). Some XML legislations works this way, e.g. in the Canadian legal tradition that aligns English and French. Where the element overloading idea makes a lot of sense is when you have documents authored in a certain language but contain other language as embedded content. For example, a German technical paper may quote some English language content within. The bother languageb elements may act as either block or inline. This is speaking from general experience, Ibm not sure how much directly relates to JATS, I guess at the end of the day it depends on what you want to do with the multi-lingual data anyway. I hope some if the ideas are helpful for the purposes of this discussion. // Gareth Oakes // Chief Project Officer, GPSL // www.gpsl.co From: "Nikos Markantonatos nikos@xxxxxxxxxx" <jats-list-service@xxxxxxxxxxxxxxxxxxxxxx> Reply to: "jats-list@xxxxxxxxxxxxxxxxxxxxxx" <jats-list@xxxxxxxxxxxxxxxxxxxxxx> Date: Friday, 6 May 2022 at 17:48 To: "jats-list@xxxxxxxxxxxxxxxxxxxxxx" <jats-list@xxxxxxxxxxxxxxxxxxxxxx> Subject: Re: [jats-list] Translations for section titles Hi Gerrit. Thanks for sharing this illustrative example. I personally find the collection of all languages under the same <title> element by means of several <name-content> elements semantically inelegant and simply unacceptable. I suspect that the multi-lingual approach described by Vincent in JATSCon will be accompanied by a corresponding relaxing of several models in JATS to allow for "zero or more" instances rather than the stricter "zero or one" logic the models allow for today. So <label> and <title> under <sec> will have to allow for "zero or more" instances. They only allow for "zero or one" today. Same goes for <label> and <title> under <list>. A similar logic must be applied to <label> and <caption> under <table-wrap>. And the list goes on and on. Vincent, I wonder whether there has been any thought on how many models in JATS will need to be relaxed to allow for true side-by-side multi-lingual encoding. Nikos On 5/4/22 9:42 PM, Imsieke, Gerrit, le-tex gerrit.imsieke@xxxxxxxxx<mailto:gerrit.imsieke@xxxxxxxxx> wrote: Hi Vincent (and list), Yesterday after your presentation and during the JATS-Con social (or happy) hour I complained that if you have a multiple-language document in which you choose to tag your content with block-level items and their translations side-by-side, you can't have translations for section titles. I was thinking of someone who followed the "content items in two or more languages" approach (https://www.ncbi.nlm.nih.gov/books/NBK579699/#lizzi-content-items-in-two-or- more-languages) by adding a translation for each figure, table-wrap, p etc. element that is allowed within a section, side by side. While you can have multiple figures, tables, paragraphs etc. in a row, you can't provide translation elements for elements that are allowed exactly once in a section, in particular, for the title element. But there's a workaround: You can have the title translations as inline content in the title element, like so: <sec id="sec1"> <title><named-content content-type="title-content" lang-group="sec1-content" xml:lang="en">English Title</named-content> <named-content content-type="title-content" lang-group="sec1-content" xml:lang="fr">Titre franC'ais</named-content> <named-content content-type="title-content" lang-group="sec1-content" xml:lang="de">Deutscher Titel</named-content></title> <p lang-group="sec1-p1" xml:lang="en">Paragraph</p> <p lang-group="sec1-p1" xml:lang="fr">Paragraphe</p> <p lang-group="sec1-p1" xml:lang="de">Absatz</p> </sec> So I think it's feasible by and large to pursue a block-level translation approach, with minor sacrifices where you need to process inline translations. (Note: I'm not advocating such an approach, but it came up occasionally when discussing how to tag multilingual content with customers.) Gerrit -- [Image removed by sender.]<https://www.atypon.com/> Nikos Markantonatos | Atypon, Greece Operations Head Leoforos Ethnikis Antistaseos 39A, 3rd floor, Nea Ionia, 14234, Greece office +302110133003 | mobile +306974302945 | nikos@xxxxxxxxxx<mailto:nikos@xxxxxxxxxx> [Image removed by sender.]atyponb.com CONFIDENTIAL: This email and any attachments may contain confidential and legally privileged information for the exclusive use of the designated recipients. Unauthorized review, use, storage, disclosure or distribution is prohibited. If you are not the intended recipient, contact the sender and destroy all copies of the original message. JATS-List info and archive<http://www.mulberrytech.com/JATS/JATS-List/> EasyUnsubscribe<http://lists.mulberrytech.com/unsub/jats-list/2708257> (by email<>)
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
Re: [jats-list] Translations for se, Nikos Markantonatos | Thread | Re: [jats-list] Translations for se, Lizzi, Vincent vince |
Re: [jats-list] Translations for se, Nikos Markantonatos | Date | Re: [jats-list] Translations for se, B Tommie Usdin btusd |
Month |