Writing System Versions
While working on a description of the Eastern Dan writing system I found it challenging to keep track of all the different writing system evolutions and the various materials produced in each writing system. As I listen to the challenges faced in various linguistic projects (archiving, OCR, language revitalization, corpus creation) around the world, which work with written linguistic materials I find that they also often have the same basic problems. That is, the same language may be written with different transcription systems. There is currently no clear and concise way to create an identifier for language transcription systems. Some have turned to the ISO 639-3 to say “the writing system of x language”, however this is often not descriptive enough. The BCP47 identifier is much more descriptive and useful in this context.
I use the term writing system rather than orthography because orthography is narrowly construed in technological contexts. That is, orthography applies to spelling, while writing system applies to the characters used to communicate in a given language (Citation: Constable, 2002) Constable, P. (2002). Toward a Model for Language Identification Defining an ontology of language-related categories. SIL Electronic Working Papers, 2002-003. Retrieved from https://www.sil.org/resources/publications/entry/7853 . Sometimes a writing system revision implies or necessitates an orthography change, sometimes it doesn’t. When an orthography change adds or removes a character (Unicode character) from the language it also implies a writing system change.
These relationships are illustrated in the following graph replicated from (Citation: Constable, 2002) Constable, P. (2002). Toward a Model for Language Identification Defining an ontology of language-related categories. SIL Electronic Working Papers, 2002-003. Retrieved from https://www.sil.org/resources/publications/entry/7853 .
For example the following languages and their writing system histories demonstrate some of the complexities which a catalogue of writing system evolutions would need to consider.
The Tarok orthography falls in the category of an SIL revision in 1976 of a missionary orthography first produced in 1919. Tarok people did a comprehensive revision in 2018, covering alphabets and rules of the writing system. Tarok [ISO 639-3: yer] is spoken Central Nigeria.
Dan is a language of Liberia and the Ivory coast [ISO 639-3: dnj]. It requires three BCP47 codes to distinguish the language-writing system combinations, because there are three writing systems one in Liberia, one for Eastern Dan, one for Western Dan. Historically, Dan-Kla [ISO 639-3: lda] now recognized as a separate language, was also lumped in with Dan, the ISO 639-3 registrar addressed this in 2013.
##3# Eastern Dan Within the Dan example Eastern Dan has a history of 5 writing system versions, some with SIL involvement, some without. These are writing system revisions, not orthography revisions because the glyphs used in each case were different. It wasn’t just the spelling system that changed, it what the character repertoire needed to write the language.
Is a set of 4 ISO 639-3 languages but the Mexican government acknowledges something like 7 unique languages. They all share one writing system but spell words how they pronounce them in their respective regions. This means that there is one writing system, but several orthographies. As far as I know no orthography is standardized.
Four countries, various different writing systems (glyphs), countries have discrete version histories, but conceded that they have one language.
A language which uses both Ajami and Latin scripts.
With these kinds of issues in mind, I crafted the following fictitious statement which exemplifies some of the things for which a good catalogue of writing system versions would account.
- Constable (2002)
- Constable, P. (2002). Toward a Model for Language Identification Defining an ontology of language-related categories. SIL Electronic Working Papers, 2002-003. Retrieved from https://www.sil.org/resources/publications/entry/7853