[Neobrahmigp] [Ext] Re: Malayalam LGR document v1.2

Sarmad Hussain sarmad.hussain at icann.org
Mon May 14 08:53:02 UTC 2018


Thank you Veena, for raising these points.



Kindly note that separate discussions around ZWJ and ZWNJ would need to be 
included in the Malayalam proposal, based on the solution finalized, 
highlighting limitations of the solution.



Regards,

Sarmad



From: veena solomon [mailto:veena.ycet at gmail.com]
Sent: Sunday, May 13, 2018 3:48 PM
To: Sarmad Hussain <sarmad.hussain at icann.org>
Cc: neo brahmi <neobrahmigp at icann.org>
Subject: [Ext] Re: Malayalam LGR document v1.2



Thank you! I have gone through the invalids and they are indeed caused by the 
joiner characters and the split matras being used.



Historically, ZWJ was used to render chillu in certain fonts but later Unicode 
included chillu characters as standalone codepoints and MSR-3 also includes 
these standalone chillu characters. So, I think ZWJ need not be added in IDN.



ZWNJ, is used to prevent the formation of conjunct ligatures and it is 
required to avoid spelling mistakes and unnecessary conjuncts. For example, in 
a 2 word label, the first word ending in virama can form conjunct with the 
second word starting in a consonant. This causes a spelling mistake.



I request the other Malayalam panel members to go through the same and make 
suggestions.



Regards,







Veena Solomon



 <https://urldefense.proofpoint.com/v2/url?u=http-3A__www.twitter.com_vinazol&d=DwMFaQ&c=FmY1u3PJp6wrcrwll3mSVzgfkbPSS6sJms7xcl4I5cM&r=KTETvEaGPwPcawI-QmNa-kiv-ZBvdgyyLm-mxd028M4&m=cPysmF2IHbHVxBNijSLpCzJqi_TxL8X332umTfcxXOw&s=VvEhcye6dbvfrcJn7QJ_CWam9TAce41RBtXjct8dYxY&e=>[twitter.com]<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.pinterest.com_vinazol&d=DwMFaQ&c=FmY1u3PJp6wrcrwll3mSVzgfkbPSS6sJms7xcl4I5cM&r=KTETvEaGPwPcawI-QmNa-kiv-ZBvdgyyLm-mxd028M4&m=cPysmF2IHbHVxBNijSLpCzJqi_TxL8X332umTfcxXOw&s=eBDbmQc5fOWmc2pbL8S6DH1_I_f41AXUP1Su9sitQug&e=>  [pinterest.com]<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.facebook.com_vinazol&d=DwMFaQ&c=FmY1u3PJp6wrcrwll3mSVzgfkbPSS6sJms7xcl4I5cM&r=KTETvEaGPwPcawI-QmNa-kiv-ZBvdgyyLm-mxd028M4&m=cPysmF2IHbHVxBNijSLpCzJqi_TxL8X332umTfcxXOw&s=aEUpbpLSynk606DUMkXhsOrHbEifZ2lvcIgcKnVDeVw&e=>  [facebook.com]<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.quora.com_Veena-2DSolomon&d=DwMFaQ&c=FmY1u3PJp6wrcrwll3mSVzgfkbPSS6sJms7
 xcl4I5cM&r=KTETvEaGPwPcawI-QmNa-kiv-ZBvdgyyLm-mxd028M4&m=cPysmF2IHbHVxBNijSLpCzJqi_TxL8X332umTfcxXOw&s=5rpQpYxzeTGeC_jFSliF4BjCij1v3-RTAFieaiJ-jL0&e=>  [quora.com]<https://urldefense.proofpoint.com/v2/url?u=http-3A__foursquare.com_user_7402337&d=DwMFaQ&c=FmY1u3PJp6wrcrwll3mSVzgfkbPSS6sJms7xcl4I5cM&r=KTETvEaGPwPcawI-QmNa-kiv-ZBvdgyyLm-mxd028M4&m=cPysmF2IHbHVxBNijSLpCzJqi_TxL8X332umTfcxXOw&s=JDVgjRlKJS2x6YG-4LFP05cKAkDIqlG5OLrjyYoQOuM&e=>  [foursquare.com]<https://urldefense.proofpoint.com/v2/url?u=https-3A__plus.google.com_u_0_105213014676403488949_&d=DwMFaQ&c=FmY1u3PJp6wrcrwll3mSVzgfkbPSS6sJms7xcl4I5cM&r=KTETvEaGPwPcawI-QmNa-kiv-ZBvdgyyLm-mxd028M4&m=cPysmF2IHbHVxBNijSLpCzJqi_TxL8X332umTfcxXOw&s=zqHxFS0PXwHOP4dehAuurlfCKTDqxqtm-CgE-Lj7XW8&e=>  [plus.google.com]On Sun, May 13, 2018 at 12:08 AM, Sarmad Hussain <sarmad.hussain at icann.org<mailto:sarmad.hussain at icann.org> > wrote: Dear All,Please find attached the Malayalam XML/HTML based on the proposal version:https://docs.google.com/doc
 ument/d/1KTmiGSuxsyrEdzkAVqA8coIqc4_eNAYygOUEl0ZcA7U/edit#heading=h.o9uxhnnsmlal [docs.google.com]<https://urldefense.proofpoint.com/v2/url?u=https-3A__docs.google.com_document_d_1KTmiGSuxsyrEdzkAVqA8coIqc4-5FeNAYygOUEl0ZcA7U_edit-23heading-3Dh.o9uxhnnsmlal&d=DwMFaQ&c=FmY1u3PJp6wrcrwll3mSVzgfkbPSS6sJms7xcl4I5cM&r=KTETvEaGPwPcawI-QmNa-kiv-ZBvdgyyLm-mxd028M4&m=cPysmF2IHbHVxBNijSLpCzJqi_TxL8X332umTfcxXOw&s=qrVv5v1D-VVXjxbbs_Ym5apegpglPmru_v5aeRJuU7E&e=> .The test results are also attached for your review, based on a Malayalamcorpus available online.  There are 17k labels which are rejected out of 130klabels.  Kindly review and see if the XML/HTML are as per the proposal.  Therejections may be because the Chillu characters are represented by theirearlier forms using the joiner characters.Kindly let us know if you have any feedback.We will share these with the IP.Regards,Sarmad
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mm.icann.org/pipermail/neobrahmigp/attachments/20180514/baa435da/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ~WRD000.jpg
Type: image/jpeg
Size: 823 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/neobrahmigp/attachments/20180514/baa435da/WRD000-0001.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 3755 bytes
Desc: not available
URL: <http://mm.icann.org/pipermail/neobrahmigp/attachments/20180514/baa435da/smime-0001.p7s>


More information about the Neobrahmigp mailing list