[gtld-tech] Specification 5 - Country names... again..
alexander.mayrhofer at nic.at
Wed Dec 4 14:11:12 UTC 2013
I have a few more comments on the lists that we have been collecting on the Github site. Specifically:
> 2. I've attached Verisign's spreadsheet for section 5.4.3 (UN Member States
> and translations). There are 7 languages (including both traditional and
> simplified Chinese). Spaces and hyphens are tricky. The issue is that
> translation engines (here I mostly use Google's service) will often handle
> spaces intelligently. For instance, "Central African Republic" translated to
> Russian becomes "Центрально-Африканская Республика". That is, a
> hyphen is introduced for the first space. I presume this is a standard for the
> "Central" modifier. You could choose to create a variant without any
> hyphens or spaces, or you could try all the permutations of hyphens and
> spaces. Instead of heading down that rabbit hole, we elected to yield to the
> translation wherever possible. But we'll respect the group decision and
> update our implementation accordingly.
On Specification 5.4.3:
The agreement says "the list of United Nations member states in 6 official United Nations languages prepared by the Working Group on Country Names of the United Nations Conference on the Standardization of Geographical Names;"
I do understand that there is an offical document from the UN that provides translations for those country names. The URL (at least the one i've been using) is: http://unstats.un.org/unsd/geoinfo/UNGEGN/docs/26th-gegn-docs/WP/WP54_UNGEGN%20WG%20Country%20Names%20Document%202011.pdf
I do understand that the above document already contains readily translated / agreed country names for the six UN languages, as required in 5.4.2. However, i do also understand that the list that's now in the Github repository was actually produced using automated translation, which potentially could yield different results compared to extracting the list of country names from the document (or the respective source file)?
Therefore, could you please clarify whether the file that's now on the Github was produced by translation engines, or taken from said document? I think that's a significant impact on whether or not that file is usable to cover ICANN's requirement.
On Specification 5.4.2:
The agreement requires "the United Nations Group of Experts on Geographical Names, Technical Reference Manual for the Standardization of Geographical Names, Part III Names of Countries of the World".
Looking at the respective document (http://unstats.un.org/unsd/geoinfo/ungegn/docs/pubs/UNGEGN%20tech%20ref%20manual_m87_combined.pdf), the actual list (starting on page 185) contains several columns:
- ISO3166 code
- COUNTRY- English (UN)
- ISO Code; language
- Short Name
- Formal Name
It is unclear to me which of the various columns is to be considered for the reserved names list. Candidates would be the "Country-ENGLISH" (although that is likely already covered by the UN list described in 5.4.3?), the "Short Name", and the "Formal Name". However, looking at the file that is now uploaded on the github for the respective Specification (https://github.com/centralnic/spec5-reserved-names/blob/master/S5.4.2.txt), it doesn't seem to correspond to *any* of these columns.
for example, the Entry for "AF" (assumed) in the file lists "afganistán" does not exist in the PDF - in neither of the respective columns. I'm slightly worried that therefore the list that is currently uploaded does not fulfill the respective requirement (besides that i think the original requirement is highly inprecise and hence disputable)..
Do i look at the wrong files? Do people interpret the requirements in a different form? Any comments appreciated...
More information about the gtld-tech