[gtld-tech] IDN Tables

John Hollifield John.Hollifield at nominet.org.uk
Tue Oct 22 11:49:51 UTC 2013


Hi

I am a bit confused about how our IDN table should be structured with respect to variants as I feel section 5, A Model Table Format, of RFC 4290 (https://www.ietf.org/rfc/rfc4290.txt) is a bit ambiguous. I am therefore hoping there is somebody out there with experience of these things that could help me. I have looked at other IDN tables but none seem to help me with the issue below. The RFC states:

Each non-comment line in the table starts with the character that is allowed in the registry and expected to be used in registrations, which is also called the "base character".

It then goes on to say:

If the base character has any variants, the base character is followed by a vertical bar character ("|", ASCII 0x7C) and the variant string.  If the base character has more than one variant, the variants are separated by a colon (":", ASCII 0x3A).

So if I have the following characters which are equivalent within the registry

e (U+0065) = è (U+00E8) = é (U+00E9) = ê (U+00EA) = ë (U+00EB)

Would my table need to be

U+0065|U+00E8:U+00E9:U+00EA:U+00EB # LATIN SMALL LETTER E (e)

which implies that everything after the | is an allowed character or do I have to explicitly take the first statement above into account where I need to put each character with its variants in a separate line therefore listing out all combinations i.e.

U+0065|U+00E8:U+00E9:U+00EA:U+00EB # LATIN SMALL LETTER E (e)
U+00E8|U+0065:U+00E9:U+00EA:U+00EB # LATIN SMALL LETTER E WITH GRAVE
U+00E9|U+0065:U+00E8:U+00EA:U+00EB # LATIN SMALL LETTER E WITH ACUTE
U+00EA|U+0065:U+00E8:U+00E9:U+00EB # LATIN SMALL LETTER E WITH CIRCUMFLEX
U+00EB|U+0065:U+00E8:U+00E9:U+00EA # LATIN SMALL LETTER E WITH DIAERESIS

If the second method is the correct way does the order then become important?

Any help would be appreciated.

Regards

John

John Hollifield
GTLD Systems and Business Data Lead
Nominet

Tel: 01865 332333
Mob: 07979 696734
Email: John.Hollifield at nominet.org.uk<mailto:John.Hollifield at nominet.org.uk>

John Hollifield
GTLD Systems and Business Data Lead
Nominet

Tel: 01865 332333
Mob: 07979 696734
Email: John.Hollifield at nominet.org.uk<mailto:John.Hollifield at nominet.org.uk>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mm.icann.org/pipermail/gtld-tech/attachments/20131022/ff319112/attachment.html>


More information about the gtld-tech mailing list