Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement Japanese bibliography rendering style #52

Open
ronaldtse opened this issue Aug 22, 2024 · 40 comments
Open

Implement Japanese bibliography rendering style #52

ronaldtse opened this issue Aug 22, 2024 · 40 comments
Assignees
Labels
enhancement New feature or request

Comments

@ronaldtse
Copy link

In relaton/relaton-plateau#15 the PLATEAU pubid has removed the Japanese characters for edition.

For JIS and PLATEAU, we need to implement the Japanese bibliography rendering style. The style referenced by JIS Z 8301:2019 is JIS X 0807 based on ISO 690.

Screenshot 2024-08-22 at 5 30 17 PM
@ronaldtse ronaldtse added the enhancement New feature or request label Aug 22, 2024
@opoudjis
Copy link
Contributor

Right now the JIS biblio style does exist, it's just severely underspecified. relaton-render will need updates to deal with things like ordinals for editions with a number in the middle. But I need real information about what is expected, in English, and not in flyspeck Kanji screengrab.

@opoudjis opoudjis self-assigned this Aug 22, 2024
@opoudjis opoudjis moved this to 🆕 New in Metanorma Aug 22, 2024
@opoudjis
Copy link
Contributor

JIS X 0807 is not a stylesheet, but an ontology, and an unadventurous copy-paste of ISO 690-2:1997, without any specialisation for Japanese. It is absurd to say that this specifies a Japanese bibliography style at all: it is the ISO 690 default, with English-only examples (!!!!!!), and as such, it is already supported in relaton-render by default. At most, we may need to revert from ISO house styling to default ISO 690 styling; but if I'm going to do so, I won't on the basis of this spec.

JIS Z 8301:2019 may have usable information for us. JIS X 0807 does not.

This is the information out of it—all of it stuff we've already done.

Electronic books, databases and computer programs

  • Example 1. CAROLL, Lewis. Alice's Adventures in Wonderland [online]. Texinfo ed.2.1. [Dortmund, Ger-many] : WindSpiel, November 1994 [cited 10 February 1995]. Available from World Wide Web : <http : //www. germany. eu. net/books/caroll/alice. html>. Also available in PostScript and ASCII versions from Internet: <ftp//ftp. Germany. EU. net/pub/books/carroll/>.
  • Example 2. Meeting agenda [online]. Gif-sur−Yvette (France) : Center dʼEtudes Nuclaires/Saclay, Service de Documentation, March 1991- [cited 30 September 1992]. Updated bimonthly. ASCII format. Available from QUESTEL.
  • Example 3. Kirk-Othmer Encyclopedia of Chemical Technology [online]. 3rd ed. New York: John Wiley, 1984 [cited 3 January 1990]. Available from DIALOG Information Services, Palo Alto (Calif.).
  • Example 4. AXWORTHY, Glenn. Where in the World is Carmen Sandiego? [disk]. Version for IBM/Tandy. San Rafael (Calif.) : Broderbund Software, 1985. 1 computer disk; 5 1/4 in. Accompanied by : 1986 World Almanac and Book of Facts. System requirements: IBM/Tandy compatibles; 128kB RAM; MS DOS 2.0, 3.0 series; graphics adapter required. Designers: Gene Portwood and Lauren Elliott.

A part of an electronic book, a part of a database, or a part of a computer program

  • Example 1. World Factbook [CD-ROM]. [Washington, D. C.] : Central Intelligence Agency, 1990. Spain vital statistics. ID number : CI WOFACT 1206.
  • Example 2. ICC British Company Financial Datasheets [online]. Hampton (Middlesex, U. K.) : ICC Online, 1992, updated 3 March 1992 [cited 11 March 1992]. Robert Maxwell Group PLC. Accession no. 01209277. Available from : DIALOG Information Services, Palo Alto (Calif.).
  • Example 3. CARROLL, Lewis. Alice's Adventures in Wonderland [online]. Texinfo. ed. 2. 2. [Dortmund, Germany] : WindSpiel, November 1994 [cited 30 March 1995]. Chapter VII. A Mad Tea−Party. Available from World−Wide Web : <http : //www. germany. eu. net/books/carroll/alice 10. html#SEC13>.

An article of an electronic book, an article of a database or an article of a computer program

  • Example 1. ZHUKOVSKY, Vladimir ; ITKIN, Vladimir ; and CHERNENKO, Lev. Helicopters over the crater. In Current Digest of the Soviet Press [online]. Columbus (Ohio) : Current Digest of the Soviet Press, 11 June 1986 [cited 14 February 1991]. Accession no. 0008752. Available from DIALOG Information Services, Palo Alto (Calif.).
  • Example 2. Belle de Jour. In Magill’s Survey of Cinema [online]. Pasadena (Calif.) : Salem Press, 1985-[cited] 1994-08-04]. Accession no.0050053. Available from: DIALOG Information Services, Palo Alto (Calif.).
  • Example 3. MCCONNELL, W. H. Constitutional History. In The Canadian Encyclopedia [CD-ROM]. Macintosh version 1. 1. Toronto: McClelland & Stewart, c1993. ISBN 0-7710-1932-7.

Electronic periodicals

  • Example 1. Journal of Technology Education [online]. Blacksburg (Va.) : Virginia Polytechnic Institute and State University, 1989- [cited 15 March 1995]. Semi-annual. Available from Internet : <gopher : //borg. lib. vt. edu : 70/1/jte>. ISSN1045-1064.
  • Example 2. Profile Canada [CD-ROM]. Toronto: Micromedia, 1993-. The Canadian Connection. Accompanied by : userʼs guide. System requirements : IBM PC or compatible; MPC Standard CD-ROM drive; DOS 3.30 or higher; 490 kB RAM; MS-DOS Extensions 2. 1 or higher. Quarterly.

Serial articles and other articles

  • Example 1. STONE, Nan. The Globalization of Europe. Harvard Business Review [online]. May−June 1989 [cited 3 September 1990]. Available from BRS Information Technologies, McLean (Va.).
  • Example 2. PRICE−WILKIN, John. Using the World−Wide Web to Deliver Complex Electronic Documents : Implications for Libraries. The Public-Access Computer Systems Review [online]. 1994, vol. 5, no. 3 [cited 1994-07-28], pp. 5-21. Available from Internet: <gopher://info. lib.uh.edu:70/00/articles/e-journals/uhlibrary/pacsreview/v5/n3/pricewil.5n3>. ISSN1048-6542

Electronic bulletin boards, e-mail conferences and e-mail systems

  • Example 1. PACS−L (Public Access Computer Systems Forum) [online]. Houston (Tex.) : University of Houston Libraries, June 1989- [cited 17 May 1995]. Available from Internet: listserv@uhupvml. uh.edu.
  • Example 2. Federal Depository Library Program Files (/GO Depository) [online]. Washington, D. C.: Government Printing Office [cited 1995-02-15]. Available from Internet : <telnet://federal. bbs.gpo.gov>.

Email

  • Example 1. PARKER, Elliott. Re: Citing Electronic Journals. In PACS−L (Public Access Computer Systems Forum) [online]. Houston (Tex.): University of Houston Libraries, 24 November 1989; 13: 29: 35 CST [cited 1 January 1995; 16: 15 EST]. Available from Internet: <telnet://[email protected]. org>.
  • Example 2. PRITCHARD, Sara. Your Request for Information about ISO Standards. [online]. Message to Margaret MORRISON. 18 February 1995 [cited 3 March 1995]. Personal communication.

@opoudjis
Copy link
Contributor

Next task, since this was not useful: look at the styling of JIS Z 8301:2019.

@opoudjis
Copy link
Contributor

For starters, the bizarre requirement to underline (dotted underline, no less) citations where the JIS standard has been updated from the canonical ISO adoption standard, or is other than the canonical ISO adoption standard, or is the canonical ISO adoption standard ignored in favour of the original ISO standard — will be ignored. That is some sort of semantic annotation of JIS references that will need to be worked out and resolved if someone ever asks for this.

... What... a strange thing to do.


JIS Z 8301:2019 is no more help than JIS X 0807: clause 21.5's examples of citations are also English, in their entirety.

Do Japanese standards never cite anything in Japanese?

What I need is a House Style; I need the Japanese counterpart to the Chicago Manual of Style, I need an overview of relevant Japanese grammar, and I need someone who speaks Japanese to do QA. Which I presume is @ReesePlews.

The query particularly that was posed up top, that we don't have "5th edn" included in Plateau identifiers any more, is not going to be addressed in JIS Z 8301: (1) edition numbers are NOT included in ISO or JIS standards under ISO or JIS; and (2) JIS Z 8301 is not where I'm going to find out that the Japanese for "5th edn." is 第5版 (or の第 5 版, I guess, in "X,
5th edn." Especially when all their examples are in English anyway.

Never mind that I'm having to guess what is going on via Google Translate. This is not the professional way of going about this.

This is useless. The useful thing to do is:

  1. I implement 第%版 in relaton-render (right now it's English "{{ var1 | ordinal_num: 'edition', '' }} edition", because I had no idea what it was.
  2. I introduce editions into Plateau (not JIS) rendering of standards.
  3. I generate a bunch of (preferably Japanese language) references in a range of bibliographic types, with the existing ISO-inherited stylesheet, and Reese can tell me what looks wrong.

@opoudjis
Copy link
Contributor

opoudjis commented Aug 30, 2024

FWIW, the default rendering of a monograph that I test on in relaton-render,

RAMSEY, J. K. and W. C. MCGREW. Object play in great apes: Studies in nature and captivity. In: PELLEGRINI, Anthony D. and Peter Kenneth SMITH (eds.): The nature of play: Great apes and humans [electronic resource, 8vo]. 3rd edition. New York, NY: Guilford Press. 2005. pp. 89–112. https://eprints.soton.ac.uk/338791/. [viewed: September 3, 2019].

is now:

RAMSEY, J. K. と W. C. MCGREW. Object play in great apes: Studies in nature and captivity. PELLEGRINI, Anthony D. と Peter Kenneth SMITH (編): The nature of play: Great apes and humans [electronic resource, 8vo]. 第3版. New York, NY: Guilford Press. 2005. ページ89–112. https://eprints.soton.ac.uk/338791/. [見た: 2019年9月3日].

@ReesePlews how wrong is that?

opoudjis added a commit that referenced this issue Aug 30, 2024
@opoudjis
Copy link
Contributor

The JIS representation of standards right now is:

standard: "{% if home_standard %}<span_class='stddocTitle'>{{ title }}</span> ,_{{ extent }}{% else %}{{ creatornames }}. <span_class='stddocTitle'>{{ title }}</span> ,_{{ extent }} .  {{ labels['version'] | capitalize }}_{{ edition_raw }}. {{labels['updated'] | capitalize }}_{{date_updated}}. {{status | capitalize}}. {{ authorizer }}. {{ labels['availablefrom'] }}:_<span_class='biburl'>{{ uri }}</span>.{% endif %}"

To spell out:

JIS, ISO: TITLE, PART (optional)

Other Standards: AUTHORS. TITLE, PART (optional). Version EDITION. Updated UPDATE-DATE. STATUS. SPONSOR. Available from: URI.

Version, Updated, Available from can be in Japanese.

So, with Japanese i18n, IETF RFC 3979 is displayed as:

S. Bradner. Intellectual Property Rights in IETF Technology. RFC Series. 入手可能: https://www.rfc-editor.org/info/rfc3979

What I'm getting here is a suggestion that Plateau documents, in Plateau, have optional edition numbers. So presumably,

JIS, ISO, Plateau: TITLE EDITION PART (optional)

Where EDITION will be the newly introduced 第5版 in Japanese, and "5th edn." in English.

So, "PLATEAU Handbook #2 1.0" right now (as a document other than JIS and ISO) renders as:

PLATEAU Handbook #2 1.0, 3D都市モデル標準作業手順書. Version 1.0. 国土交通省. 入手可能: https://www.mlit.go.jp/plateau/file/libraries/doc/plateau_doc_0002_ver01.pdf.

I'm assuming what you want, @ronaldtse , in your cryptically phrased ticket, is

PLATEAU Handbook #2 1.0, 3D都市モデル標準作業手順書. 第1.0版. 国土交通省. 入手可能: https://www.mlit.go.jp/plateau/file/libraries/doc/plateau_doc_0002_ver01.pdf.

Is that what you want? Do you want just the title as with ISO/JIS, so

PLATEAU Handbook #2 1.0, 3D都市モデル標準作業手順書. 第1.0版.

... or what?

@opoudjis
Copy link
Contributor

Italics are still happening in titles, and they shouldn't be.

@opoudjis
Copy link
Contributor

Needed to tell Plateau explicitly to use JIS relaton-render class.

opoudjis added a commit to metanorma/metanorma-plateau that referenced this issue Aug 30, 2024
@ronaldtse
Copy link
Author

@ReesePlews will also supplement with some guidelines for Japanese bibliography.

@ronaldtse
Copy link
Author

ronaldtse commented Aug 31, 2024

@opoudjis indeed, JIS X 0807 is an adoption of the old ISO 690-2, and doesn't seem to provide Japanese bibliography rules.

I extracted two relevant examples from the JIS Z 8301 bibliography. I hope this is sufficient for our implementation.

Example 1

現代仮名遣い 昭和 61.7.1 内閣告示第 1 号
 入手先 [オンライン 2018.07.12 閲覧]:
 http://www.mext.go.jp/b_menu/hakusho/nc/k19860701001/k19860701001.html

This means:

{title: 現代仮名遣い} {date: 昭和 61.7.1} {subtitle: 内閣告示第 1 号}
 {source: 入手先} [{online: オンライン} {2018-07-12: 2018.07.12} {read: 閲覧}]:
 http://www.mext.go.jp/b_menu/hakusho/nc/k19860701001/k19860701001.html

The symbol is entered by alt + square bracket on the Japanese keyboard.

Example 2

文部科学省用字用語例 平成 23.3
 入手先:新訂 公用文の書き表し方の基準(資料集),文化庁編集,第一法規,2011,pp. 313-346.
{article title: 文部科学省用字用語例} {date: 平成 23.3}
 {source: 入手先}: {book title: 新訂 公用文の書き表し方の基準(資料集)},{collection title: 文化庁編集},{publisher: 第一法規},{date of publication: 2011},{page numbers: pp. 313-346}.

@ReesePlews
Copy link

ReesePlews commented Aug 31, 2024

hello @opoudjis thanks for examining this issue. i know bibliographic research takes places here but i dont know what documents exist that describe these. i have not seen a "Japanese like - Manual of Style" or similar document. perhaps the National Diet Library could provide some references. we could also ask JSA what the status is of any updates for JIS X0807 harmonizing with ISO 690:2021.

for our work here, i did some searching and found that the Japan Science and Technology Agency (JST) had a project called SIST from the mid-2000s thru mid-2010s that was tasked to develop "The Standards for Information for Science and Technology (SIST) are standards designed to facilitate the distribution of scientific and technical information. the project has concluded and the website is archived on the government archive server (WARP). From the WARP server we can access all of the "standard" documents and some other materials.

SIST 02 along with a PDF for download describes the preparation of bibliographic/citation references. i also happened to find this github site but the links were based on the original site (no the WARP site now), however they could be searched using the SIST 02 html manual pages. the github site also had some english information.

from the SIST 02 document we can pickup some styling guidelines.

regarding the earlier examples that you have prepared here are some comments (based on SIST 02)

[見た: 2019年9月3日] in your example should be changed to [参照: 2019年9月3日] (even though you may receive a translation of "see also" or "reference" it is the correct notation for "accessed or viewed". 見た comes up as a correct translation of "viewed" but it is not correct for written documents.

some of your examples use 入手可能 which can mean "available". that japanese could be changed to something from SIST02 such 入手日付 (date of access) .

use of " ページ" is not used for these type of references. "p." or "pp." are well understood.

use of "と" for "and" in a list of authors... the meaning is correct but i dont think i have seen that used in a list of authors, english or japanese. perhaps the "と" can be left out but if the English model requires that "and" is used in a list of authors we could keep it in and ask the Plateau Team members.

use of "編" for "Editor, Ed." is acceptable. it is often used in Japanese references.

use of " 第5版" for "5th edition" or something similar is a correct translation and it would be used in Japanese references. i think it could be kept here, unless it is difficult to manage.

i see @ronaldtse has provided some new information while i have been preparing this answer.
after you have it ready, i will ask the Plateau Team for feedback. thank you.

@ReesePlews
Copy link

@ronaldtse and i were checking the 文化庁 site a while back and i thought they had some guidelines on the creation of "written documents". i could not find that link today. Ron do you remember that?

@ronaldtse
Copy link
Author

The guidelines for official documents from the Bunka-cho does not specify anything about the bibliography, and itself uses QR codes and links for references 😓

it is here: https://www.bunka.go.jp/seisaku/bunkashingikai/kokugo/hokoku/pdf/93651301_01.pdf

@ronaldtse
Copy link
Author

@ReesePlews I checked SIST 02. It does contain many bibliographic reference examples, but they are mostly using ISO 690 style bibliographic presentation with English half-width punctuation. The guidelines actually cite the JIS adoption of ISO 690 and ISO 690 itself.

@opoudjis and I did include some Japanese bibliographic practices in the latest ISO 690 as examples, but they are insufficient. So there is a future opportunity to do so...

@opoudjis
Copy link
Contributor

opoudjis commented Sep 1, 2024

Punctuation should be i18n'd in the output, but it looks like this is only being done for Chinese, not Japanese. Punctuation i18n involves full-width equivalences for Roman punctuation.

This is what we do for Chinese. Can I confirm that the same needs to be done for Japanese, or should I do different substitutions? Left-hand Roman, right-hand CJK.

: :
, ,
. .
) )
] ]
: :
; ;
? ?
! !
– ~ # en-dash
( (
[ [

@opoudjis
Copy link
Contributor

opoudjis commented Sep 1, 2024

Removing と for "and"; using "," instead (so as to keep the author_and templates common with JA)

Using "p, pp" instead of ページ

The date accessed is given as 見た currently. I am changing it to 参照

@opoudjis
Copy link
Contributor

opoudjis commented Sep 1, 2024

@opoudjis indeed, JIS X 0807 is an adoption of the old ISO 690-2, and doesn't seem to provide Japanese bibliography rules.

I extracted two relevant examples from the JIS Z 8301 bibliography. I hope this is sufficient for our implementation.

Example 1

現代仮名遣い 昭和 61.7.1 内閣告示第 1 号
 入手先 [オンライン 2018.07.12 閲覧]:
 http://www.mext.go.jp/b_menu/hakusho/nc/k19860701001/k19860701001.html

This means:

{title: 現代仮名遣い} {date: 昭和 61.7.1} {subtitle: 内閣告示第 1 号}
 {source: 入手先} [{online: オンライン} {2018-07-12: 2018.07.12} {read: 閲覧}]:
 http://www.mext.go.jp/b_menu/hakusho/nc/k19860701001/k19860701001.html

using Google Translate:

Modern Kana Spelling, July 1, 1986, Cabinet Notification No. 1. Available from [online, accessed July 12, 2018]:

So: { title } { date } [and I'm not converting to regnal years, that should be done in the source] { series } Available from [online {date-accessed} read : url ]

.... And this gives me the answer to the question I had, what is "available from": 入手先 ... 閲覧

This means I have to change

_{{ labels['availablefrom'] }}:_<span_class='biburl'>{{ uri }}</span>. [{{ labels['viewed'] }}:_{{date_accessed}}].

to

{% if uri %}{{ labels['availablefrom'] }} [{{ labels['online'] }} {% if date_accessed %}{{ viewed }}{% endif %} {{ uri }}]{% endif %}

Japanese 

availablefrom = 入手先
online = オンライン
viewed = {{ var1 }} 閲覧 # populated in code with date_accessed

English

availablefrom = Available from
online = online
viewed = viewed {{ var1 }} 

Available from [online {date-accessed} read : url ]

The symbol is entered by alt + square bracket on the Japanese keyboard.

Query on punctuation localisation posted above.

Example 2

文部科学省用字用語例 平成 23.3
 入手先:新訂 公用文の書き表し方の基準(資料集),文化庁編集,第一法規,2011,pp. 313-346.
{article title: 文部科学省用字用語例} {date: 平成 23.3}
 {source: 入手先}: {book title: 新訂 公用文の書き表し方の基準(資料集)},{collection title: 文化庁編集},{publisher: 第一法規},{date of publication: 2011},{page numbers: pp. 313-346}.

Ministry of Education, Culture, Sports, Science and Technology, Examples of Characters and Terminology, March 2011. Available from: Revised Standards for Writing Official Documents (Collection of Materials), edited by the Agency for Cultural Affairs, Dai-Ichi Hoki, 2011, pp. 313-346.

So 入手先 is "in" presumably for a book chapter.

From these examples, there is no full stop between title, date, and series, no putting series in brackets, and there is a comma between publisher and page numbers.

These examples are confusing. Why is the date repeated in Example 2? If 文化庁編集 is a series, why is "Edited by the Agency for Cultural Affairs"? Is that an editor indication instead?

I am going to get something done as best as I can make sense of this information, and if this doesn't produce acceptable output, I am going to require precise explanation as to why. These are programmatically generated references, and they need programmatic structure.

@ReesePlews
Copy link

hello @opoudjis thank you for the additional investigation on this. i will send these samples to the client and see if i can get some feedback today. the National Diet Library has these pages on "bibliographic data", the translation and browsing works well with chrome. along with other information on digital library projects. additionally some translations of various metadata documents and such. there is a "standardization" contact address there at the bottom of the page. perhaps @ronaldtse, as one of the editors for ISO 690, can reach out to them and see if they know of open information or an actual Japanese style reference for bibliographic entries.

@opoudjis
Copy link
Contributor

opoudjis commented Sep 2, 2024

This is going to miss the release, which is today, but I don't think this is an hour's work anyway.

What I would particularly like is examples not of report citations, which are odd in how they are presented because of how ad hoc their publication and metadata is; but of more established formats, that I can extrapolate from: books, book chapters, journal articles.

I am concerned that there is idiosyncrasy in the Japanese bibliographies: we'll add a day-month-date here but not there, a regnal year here but a Gregorian date there, we will (maybe?) conflate publishers and editors. Reports are rife with that kind of uncertainty, but I will not be implementing uncertainty. These references are generated by template: they will not be hand crafted. So I really do need prototypical, not atypical references, to work out what the structure is. (And journal articles are more prototypical than reports.)

I will sift through what you've sent after I do the release, but... I need consolidated and straightforward advice on bibliographic formatting, preferably with exemplars, preferably not involving reports, and preferably from the client. I'm sure there is a house style at the client, particularly around the use of punctuation in references.

I do also need confirmation on full-width punctuation for Japanese @ronaldtse @ReesePlews , see #52 (comment)

@ReesePlews
Copy link

i think it is a difficult call to make with the double-byte punctuation. we would never want to use a double byte "comma" between non-japanese authors, even if that is japanese bib entry. we may find a kanji comma "、" but i think that would be unbalanced too. the kanji comma "、" is more used in the actual body text of the document. since the bib entries are not considered "body text" i think they would be using single byte punctuation; but that is just my comment.

what about the SIST 02 document? i think we may have better luck finding style guides for scientific publications more than government documents. from my limited experience with government report preparation, the main emphasis is on the type of sections included, rather than the style. in my experiences, the end users are strict about having specific "common named" sections so the documents are easy to navigate rather than worrying so much about the styles... they will advise on fonts, style of non japanese words (single instead of double byte), etc.

SIST 02 along with a PDF for download describes the preparation of bibliographic/citation references. i also happened to find this github site but the links were based on the original site (no the WARP site now), however they could be searched using the SIST 02 html manual pages. the github site also had some english information.

@opoudjis
Copy link
Contributor

opoudjis commented Sep 2, 2024

The thing to do with bits of Roman text interspersed with CJK text is, to apply CJK localisation of punctuation only if the punctuation has CJK text either side. That will address the concern about commas misapplied between Western authors; but you are saying that even in Japanese citations, there is a reluctance to use full-width punctuation. That surprises me, but that looks like a two tier punctuation localisation: no localisation of punctuation proper, only of brackets, when they contain CJK text.

@ronaldtse
Copy link
Author

@opoudjis I think in a CJK context, only localized punctuation should be used. There is no logic to use non-localized punctuation. I will make sure this happens in the next 690. The current specs that use half width punctuation and full width punctuation inconsistently is due to 690 itself not offering logical advice.

@ReesePlews
Copy link

i was concerned you might have been thinking that the following double-byte characters ( ,and .) were to be substituted for single byte (, and .) when an english entry would be shown in japanese. we would not want that type of punctuation substitution, in my understanding.

i have inquired with the client, awaiting a reply.

@opoudjis
Copy link
Contributor

opoudjis commented Sep 2, 2024

So, a CJK context does need to be defined as a preceding character (for most characters) or a following character (for opening brackets) being CJK. If the character is Roman, leave it alone.

That has not been implemented yet, the implementation currently globally localises punctuation for a language, no matter what specific characters precede it. That, @ronaldtse, is a refinement of the notion of CJK context, so it still complies with what you want. And that, @ReesePlews, fixes the issue you're worried about.

I'm awaiting feedback on this one. Brackets are clearly going to be localised, I'm waiting to hear what the client expects to happen with punctuation.

@ronaldtse
Copy link
Author

ronaldtse commented Sep 2, 2024

@opoudjis I'm not convinced we need to have a special "CJK context" because it means there will be a "lang context" (C/J/K punctuation conventions, even for brackets, are different).

I do not quite see a need for this, and need to be convinced.

I think a Japanese bibliography entry is just that, a Japanese language bibliography entry that uses Japanese conventions. If there are English etc inside it, we will still apply the Japanese style rules.

@opoudjis
Copy link
Contributor

opoudjis commented Sep 2, 2024

In other words, you think

Johnson、 A。、 Peters、 B。 1976。 The origins of sound 【series】。 London〯Blackwells。

is absolutely correct rendering in a Japanese or Chinese bibliography.

We really don't inhabit the same universe any more, as I've repeatedly have been finding recently, but FWIW, that's what is currently going to happen if I tell relaton-render to follow CJK punctuation for Japanese.

I await the response from MLIT with interest.

@ReesePlews
Copy link

Johnson、 A。、 Peters、 B。 1976。 The origins of sound 【series】。 London〯Blackwells。

ouch! that will not work. i have never seen any entry like that; and just to clarify (because i had problems with rendering in an earlier reply) this is what i am seeing (as an image) of this line above.

image

lets wait and see what the client comes back with.

@ReesePlews
Copy link

additionally, in my understanding, these 【 】 are not a replacement for simple double-byte [ ] . i dont see these 【 】 used that often in technical writing. have you found a source that says they are commonly used?

@ReesePlews
Copy link

hello @opoudjis i have discussed the bibliographic style rules with the client this morning. do you have a simple text list of the english terms you want to have translated into japanese. if you can show me the file, or list, i will work on that with the client. from memory they did not have a reference they could tell me where these rules were defined, so i think they will check some sources they are familiar with and come up with rules we can use. diving deeper into that may require consulting with local ISO TC 46 experts or the NDL. thank you.

@opoudjis
Copy link
Contributor

opoudjis commented Sep 3, 2024

I'll get you the list, but I'm more interested in the formatting: we've addressed most of the translations already.

What I'm going to do is generate some sample references in Japanese, illustrating how the templates work (multiple authors, multiple bibliographic styles), and I'll request proofreading comments on them. Putting something in front of them is how we're going to get comments. I do see some reasonable-looking citations in Japanese in http://fbennett.github.io/sist02/ , thank you for that.

And I'm going to put fullwidth punctuation in Roman text in front of them, because that's something I want cauterised quickly—this:

I think a Japanese bibliography entry is just that, a Japanese language bibliography entry that uses Japanese conventions. If there are English etc inside it, we will still apply the Japanese style rules.

leads to this:

image

And that's already happening now in Chinese.

I'm busy with another task, but I'll try to get something together tonight.

@ronaldtse
Copy link
Author

Please hold off this task for the moment, I found some really good bibliographies in Japanese that will shed light on this topic, and also possibly 690.

@ReesePlews
Copy link

thanks @opoudjis and @ronaldtse. sounds like a good plan. i received a document from the client but i think it is a bit too simplified for our requirements. i look forward to the discussion at your convenience.

i will say though on that image (re-pasted here), it looks really strange to me. but we can discuss later.
363831356-d5ccaf39-a689-40c6-a16e-ef3dfd53b563

thanks!

@opoudjis
Copy link
Contributor

opoudjis commented Sep 6, 2024

On hold until Ronald produces his samples, but SIST02 are already adequate to the requirement.

@ReesePlews
Copy link

hi @opoudjis thanks for checking the SIST02 document.

@ronaldtse
Copy link
Author

I have checked a number of bibliographic compilations in the Japanese National Diet Library for usage of Japanese bibliographic styles.

I found two bibliographic collections in the NDL that are illustrate a definitive, traditional, bibliographic style -- one that is based on traditional East Asian bibliographic texts, not affected by European style bibliographies while accommodating European citations.

I suggest we adopt this style for Japanese citations.

Sample entry in 東京府文献叢書 (Tokyo Government Bibliographic Collection)

Short description Long description
Screenshot 2024-09-23 at 6 09 44 PM Screenshot 2024-09-23 at 6 13 19 PM

日本經濟史文獻

Screenshot 2024-09-15 at 9 38 34 PM Screenshot 2024-09-15 at 9 44 55 PM Screenshot 2024-09-15 at 9 46 31 PM Screenshot 2024-09-15 at 9 49 31 PM Screenshot 2024-09-15 at 9 52 53 PM Screenshot 2024-09-15 at 9 57 11 PM Screenshot 2024-09-15 at 10 02 16 PM Screenshot 2024-09-15 at 10 04 03 PM Screenshot 2024-09-15 at 10 05 46 PM Screenshot 2024-09-15 at 10 06 28 PM

中華貨幣史

Screenshot 2024-09-16 at 7 24 06 PM Screenshot 2024-09-16 at 7 25 51 PM Screenshot 2024-09-16 at 7 27 33 PM Screenshot 2024-09-16 at 7 30 18 PM Screenshot 2024-09-19 at 12 58 02 PM Screenshot 2024-09-19 at 12 59 06 PM Screenshot 2024-09-19 at 12 55 49 PM Screenshot 2024-09-19 at 1 00 48 PM Screenshot 2024-09-19 at 3 31 05 PM Screenshot 2024-09-16 at 8 46 26 PM Screenshot 2024-09-19 at 3 31 54 PM

@ReesePlews
Copy link

very interesting bit of research. thanks for compiling this @ronaldtse, a lot of work to locate and annotate those.

i notice the age of the sources are quite old, pre-digital period. that would be a concern to me, unless this research is mostly about identifying parts of the bibliographic entry/record for styles/rendering, so that they could be added to your tools.

did you reach out to the NDL or just find these through a search? i wonder if they could recommend a newer document, something post 90s so we can see styling of entries for digital artifacts.

i know there is an interest in vertical layout. i have never encountered a client/end user requirement for vertical layout, but i know that some documents only use vertical layout. having the flexibility for vertical layout would definitely be interesting and useful for some users here. but it is not a priority in any of my current projects.

i would also be concerned with the punctuation in our bibliographic entries. i would make a guess that sometimes the bibliographic entries will be formatted with non-japanese punctuation, similar to western bibliographic entries. there could even be a mix, with japanese entries having japanese punctuation and western entries using western punctuation. i can understand that in older sources, the entries would have a definite japanese style because most readers of the time my not be accustomed to seeing western styles. that would not be the case today.

i look forward to continuing the discussion. if there are changes implemented let me know and i will check them with the client.

@ronaldtse
Copy link
Author

ronaldtse commented Sep 23, 2024

Thanks for sharing the thoughts. It is interesting that actually the bibliographic format has not really changed in JIS (for non-standards, of course).

This is JIS X 8301-2019:
Screenshot 2024-09-23 at 8 41 19 PM

Notice that there is minimal usage of punctuation, because in East Asian text the spacing is somewhat semantically significant.

With regards to these questions:

  • How do we handle an English/Foreign language bibliographic entry in a Japanese bibliography?
    • It seems that we need to use the language native format (English entry in English)
    • We could also treat it as a Japanese entry from English (English entry in Japanese)
  • Punctuation in a Japanese bibliography
    • We need to figure out. In 8301, the punctuation is used for "Online" or additional annotations. We will need to map those to the Japanese format.

@opoudjis
Copy link
Contributor

While we can vary the actual punctuation depending on language, I'd really rather we didn't.

I insist that the choice of full-width vs half-width punctuation should be dependent on linguistic context, not on the overall document language. (So Roman punctuation near Roman text, Japanese punctuation near Japanese text.) The alternative is indefensible.

The templates and translations to be updated are:

https://github.com/metanorma/metanorma-jis/blob/main/lib/relaton/render-jis/config.yml
https://github.com/metanorma/metanorma-jis/blob/main/lib/isodoc/jis/i18n-ja.yaml
https://github.com/relaton/relaton-render/blob/main/lib/isodoc-yaml/i18n-ja.yaml
https://github.com/metanorma/isodoc/blob/main/lib/isodoc-yaml/i18n-ja.yaml

What you have supplied me... is not enough for that. For starters, do I treat the precedence as JIS X 8301-2019 > STIS > Random National Library scans?

The i18n will also need updating for position of phrases, it presupposes Indo-European syntax currently a lot.

This is going to have to be iterative, and I will need target references to work with.

@ronaldtse
Copy link
Author

So Roman punctuation near Roman text, Japanese punctuation near Japanese text

Do you really mean this?

  • Japanese bibliographic style for Japanese items; English bibliographic style for English items

For a bibliography that cites sources from multiple languages, is it reasonable to have separate sections (or grouped items) for each language?

As for using an English bibliographic style for Japanese:

  • I am opposed to this.
  • Some languages don't use punctuation. We can't "make up" punctuation usage for them. e.g. East Asian text have no italics : CJK fonts don't supply them (yes, word processors do skew text to achieve that, but there's no such font style.) We should not force that.

SIST is actually an adoption of ISO 690's style, including punctuation usage. It is the primary citation of SIST 02, see below. We know 690 doesn't work well with CJK:
Screenshot 2024-09-24 at 6 46 40 PM

I'll come up with something that we possibly should bring back to a new, internationalized, 690.

@opoudjis
Copy link
Contributor

opoudjis commented Sep 24, 2024

I'm continuing this offline. I am putting this ticket back on hold.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants