[Latingp] character-based analysis

Ahmed Bakhat ahmedbakhat at yahoo.com
Sat Jun 3 20:20:19 UTC 2017


Dear Mirjana and all group members of Repertoire sub group,
I think first we have to focus on available characters under available Unicode charts for Latin Script, then we have to devise principles / rules for inclusion / exclusion / deffer, on the basis of usage in different languages. After having a table, we have to look for the usage in language.

I am attaching first draft of principles for Latin Script, available Unicode charts and MSR-2 documents, for start of the discussion of the group, thous 1st chart (0000 to 007F)  does not need any discussion as it is already in use as ASCII code.

Best Regards,
Ahmed Bakht  


On Thursday, May 25, 2017, 7:49:47 PM GMT+5, Mirjana Tasić <Mirjana.Tasic at rnids.rs> wrote:


Dear Nebiye,
 
  
 
I am trying to understand the idea behind your proposal. What is the purpose of looking for specific characters through all languages.  Are you trying to develop the Repertoire of all characters used in languages with Latin script for future processing?
 
  
 
Regards Mirjana
 
  
 
From: <latingp-bounces at icann.org> on behalf of Mats Dufberg <mats.dufberg at iis.se>
Date: Thursday, May 25, 2017 at 12:17
To: Textual Solutions <textualsolutions at gmail.com>, Latin GP <latingp at icann.org>
Subject: Re: [Latingp] character-based analysis
 
  
 
1.     If not found we still do not know if it should be included or not.
 
2.     We have to return to all languages for characters that we have not found elsewhere.
 
3.     We have to investigate all characters in every language anyway to make to see if it has any combination of base character and combining mark.
 
 
 
For every character (or combination) that we want to include we should find evidence that it is used according to the principles. To have a firm ground we should not just register for one language, but for several, in case some language is excluded at a later stage or that evidence is found to be invalid.
 
 
 
 
 
Mats
 
 
 
---
 
Mats Dufberg
 
DNS Specialist, IIS
 
Mobile: +46 73 065 3899
 
https://www.iis.se/en/
 
 
 
 
 
From: <latingp-bounces at icann.org> on behalf of Textual Solutions <textualsolutions at gmail.com>
Date: Thursday 25 May 2017 at 09:21
To: Latin GP <latingp at icann.org>
Subject: [Latingp] character-based analysis
 
 
 
Dear All, 
 
Each member of the Rep. group may be invited to look at one character only across the languages listed. What do you think? Pls see sample attached and comment. Thanks. 
 
NPK
 _______________________________________________
Latingp mailing list
Latingp at icann.org
https://mm.icann.org/mailman/listinfo/latingp
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mm.icann.org/pipermail/latingp/attachments/20170603/2c52a8dc/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: code Point Principles for Latin   Script GP Ver 0.1.docx
Type: application/vnd.openxmlformats-officedocument.wordprocessingml.document
Size: 27915 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/latingp/attachments/20170603/2c52a8dc/codePointPrinciplesforLatinScriptGPVer0.1-0001.docx>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: MSR-2-non-cjk-13apr15-en.pdf
Type: application/pdf
Size: 2528048 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/latingp/attachments/20170603/2c52a8dc/MSR-2-non-cjk-13apr15-en-0001.pdf>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Unicode Basic Table 0000 -   007F.pdf
Type: application/pdf
Size: 446420 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/latingp/attachments/20170603/2c52a8dc/UnicodeBasicTable0000-007F-0001.pdf>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Unicode Extended A Table 0100 -   017F.pdf
Type: application/pdf
Size: 201818 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/latingp/attachments/20170603/2c52a8dc/UnicodeExtendedATable0100-017F-0001.pdf>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Unicode Extended Additional Table   1E00 - 1EFF.pdf
Type: application/pdf
Size: 277141 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/latingp/attachments/20170603/2c52a8dc/UnicodeExtendedAdditionalTable1E00-1EFF-0001.pdf>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Unicode Extended B Table 0180 -   024F.pdf
Type: application/pdf
Size: 417124 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/latingp/attachments/20170603/2c52a8dc/UnicodeExtendedBTable0180-024F-0001.pdf>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Unicode Extended C Table 2C60 -   2C7F.pdf
Type: application/pdf
Size: 216997 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/latingp/attachments/20170603/2c52a8dc/UnicodeExtendedCTable2C60-2C7F-0001.pdf>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Unicode Extended D Table A720 -   A7FF.pdf
Type: application/pdf
Size: 328223 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/latingp/attachments/20170603/2c52a8dc/UnicodeExtendedDTableA720-A7FF-0001.pdf>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Unicode Extended E Table AB30 -   AB6F.pdf
Type: application/pdf
Size: 236453 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/latingp/attachments/20170603/2c52a8dc/UnicodeExtendedETableAB30-AB6F-0001.pdf>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Unicode Supplement Table 0080 -   00FF.pdf
Type: application/pdf
Size: 466082 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/latingp/attachments/20170603/2c52a8dc/UnicodeSupplementTable0080-00FF-0001.pdf>


More information about the Latingp mailing list