[UA-discuss] Assamese
Asmus Freytag
asmusf at ix.netcom.com
Wed May 16 22:02:53 UTC 2018
All,
it might be useful if the UA community where more aware of the
continuing efforts to request a disunification of Assamese from Bengali
on a script basis. The forwarded message contains a link to the proposal
so you can read along. (The proposal calls for the "inclusion of the
Assamese script" in Unicode/ISO 10646 which sidesteps the fact that the
script currently encoded in Unicode, despite the name "Bengali" is in
fact covering both the Bengali and Assamese languages).
As noted by the person commenting on it on the Unicode list, the issues
cited are all common for cases where more than one language shares the
same script.
If you look in the proposal document, you will see that the list of
characters are not really distinct from each other; so it's not the case
that the languages use different forms of the same basic letters (unlike
the European languages when historically German would use other letter
shapes than French).
The consequences of splitting the script for IDNs would be pretty
drastic, as every single label would have to be a blocked variant of
some label in the "other" script; worse would be the issue that users
seeing a label in print would (in many cases) not be able to tell which
script to use to enter it.
Even for regular text, the situation would be chaotic, as you could type
either language in either script and it would largely "look" OK, but
sort differently. Add to that, the issue that decades of existing data
will continue to exist in what would then be the "wrong" script.
What is of concern here is not that there is a high likelihood of
Unicode accepting a proposal like that, but the level of activity in the
community being geared up in support of it.
Let's hope it doesn't ever get there,
A./
-------- Forwarded Message --------
Subject: L2/18-181
Date: Wed, 16 May 2018 13:46:22 -0700
From: Doug Ewell via Unicode <unicode at unicode.org>
Reply-To: Doug Ewell <doug at ewellic.org>
To: Unicode Mailing List <unicode at unicode.org>
http://www.unicode.org/L2/L2018/18181-n4947-assamese.pdf
This is a fascinating proposal to disunify the Assamese script from
Bengali on the following bases:
1. The identity of Assamese as a script distinct from Bengali is in
jeopardy.
2. Collation is different between the Assamese and Bengali languages,
and code point order should reflect collation order.
3. Keyboard design is more difficult because consonants like ক্ষ
are encoded as conjunct forms instead of atomic characters.
4. The use of a single encoded script to write two languages forces
users to use language identifiers to identify the language.
5. Transliteration of Assamese into a different script is problematic
because letters have different phonological value in Assamese and
Bengali.
It will be interesting to see where this proposal goes. Given that all
or most of these issues can be claimed for English, French, German,
Spanish, and hundreds of other languages written in the Latin script, if
the Assamese proposal is approved we can expect similar disunification
of the Latin script into language-specific alphabets in the future.
--
Doug Ewell | Thornton, CO, US | ewellic.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mm.icann.org/pipermail/ua-discuss/attachments/20180516/e46ab7d6/attachment.html>
More information about the UA-discuss
mailing list