[gtld-tech] Specification 5 - Country names... again..

Colosi, John jcolosi at verisign.com
Mon Jan 13 16:26:28 UTC 2014


Good points Gavin.  Makes sense to me.

Using your strategy then, we should only check in U-labels to the repo.  But it might be nice to have a tool that could capture all of the files in the repo, and "compile" them into a single file of unique A-labels.


John Colosi
Senior Manager of Product Development
JColosi at Verisign.com

m: 703-967-4062 t: 703-948-3211
12061 Bluemont Way, Reston VA 20190

VerisignInc.com 



-----Original Message-----
From: Gavin Brown [mailto:gavin.brown at centralnic.com] 
Sent: Monday, January 13, 2014 9:48 AM
To: Colosi, John; gtld-tech at icann.org
Cc: Gould, James; Anderson, Marc
Subject: Re: [gtld-tech] Specification 5 - Country names... again..


On 10/01/2014 15:52, Colosi, John wrote:
> Hi Gavin, it looks like most of the files in the repo are using the utf8 format.  But S5.4.3.txt seems to be in utf16.  (It starts with a bunch of surrogate pairs.)  I wonder if we can standardize on a single format.

I'll see what I can do about converting that file to UTF-8. iconv complains for me when I try to convert that file from utf-16 to utf-8, so I wonder if there has been some mixing of encodings when the file was assembled.

> I might even suggest using A Labels as copying and pasting and comparing is less error prone (for me).  Maybe I just don't have the right tools.  If we decide to standardize then I can help with conversions, but wanted to get some input from folks.

Using A-labels would be less error-prone, but also harder for people who speak the relevant languages. Under ideal circumstances, we'd have language experts reviewing the strings, and it would be a real pain for them to have to keep converting A-labels to U-labels and back again. The U-labels are the source code: the stuff that human beings work with.

G.

--
Gavin Brown
Chief Technology Officer
CentralNic Group plc (LSE:CNIC)
Innovative, Reliable and Flexible Registry Services for ccTLD, gTLD and private domain name registries https://www.centralnic.com/

CentralNic Group plc is a company registered in England and Wales with company number 8576358. Registered Offices: 35-39 Moorgate, London, EC2R 6AR.



More information about the gtld-tech mailing list