[Latingp] Latin GP findings - Cross-script variants with Cyrillic script.

Mirjana Tasić Mirjana.Tasic at rnids.rs
Mon Nov 5 17:01:19 UTC 2018


Dear Cyrillic GP,

Latin GP has been working on Cross-script variants with Cyrillic script.
Latin GP wants to share with you two findings of Latin GP:


  1.  Latin GP has identified one variant set which would lead to an in-script variant in Cyrillic script LGR.
  2.  Latin GP has found 6  cross-script variants with Cyrillic script which were not included by Cyrillic GP.

Cyrillic in-script variant finding
During a thorough investigation of variants with Cyrillic script, Latin GP  came to the conclusion that one Latin character, U+0079, should be a variant of two different Cyrillic characters, U+04AF and U+0443. Cyrillic GP has already classified U+0079 and U+0443 as variants. Latin GP considers U+04AF to be sufficiently similar to the other characters.

This finding leads towards an in-script variant in Cyrillic script, i.e. between U+04AF and U+0443, due to the requirement of transitivity.

Codepoints of this variant include:

Source Unicode Name

Source Code Point

Source Glyph

Target Glyph

Target Code Point

Target Unicode Name

Rationale

LATIN SMALL LETTER Y

0079

y

ү

04AF

CYRILLIC SMALL LETTER STRAIGHT U

Glyphs identical due to font design

LATIN SMALL LETTER Y

0079

y

у

0443

CYRILLIC SMALL LETTER U

Homoglyph



The next steps might  be the discussion with IP.


Latin-Cyrillic cross-script analysis findings

Latin GP has identified the following cross-script variants set between Latin and Cyrillic script which are not included by Cyrillic GP



Source Unicode Name

Source Code Point

Source Glyph

Target Glyph

Target Code Point

Target Unicode Name

Rationale

1.

LATIN SMALL LETTER R

0072

r

г

0433

CYRILLIC SMALL LETTER GHE

Glyphs nearly identical due to font design

2.

LATIN SMALL LETTER Y

0079

y

ү

04AF

CYRILLIC SMALL LETTER STRAIGHT U

Glyphs nearly identical due to font design

3.

LATIN SMALL LETTER R WITH ACUTE

0155

ŕ

ѓ

0453

CYRILLIC SMALL LETTER GJE

Glyphs nearly identical due to font design

4.

LATIN SMALL LETTER R WITH STROKE

024D

ɍ

ғ

0493

CYRILLIC SMALL LETTER GHE WITH STROKE

Glyphs nearly identical due to font design

5.

LATIN SMALL LETTER U WITH DOT BELOW

1EE5

ụ

џ

045F

CYRILLIC SMALL LETTER DZHE

Glyphs nearly identical due to font design

6.

LATIN SMALL LETTER Y WITH TILDE

1EF9

ỹ

Ӯ

04EF

CYRILLIC SMALL LETTER U WITH MACRON

Glyphs nearly identical due to font design




Best regards
Mirjana Tasic
Latin GP chair

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mm.icann.org/pipermail/latingp/attachments/20181105/8190ad40/attachment-0001.html>


More information about the Latingp mailing list