[tz] Stable fixed length identifiers for IANA time zones

Tobias Conradi tobias.conradi at gmail.com
Sat May 26 23:24:31 UTC 2012


Below is a complete mapping of identifiers from timezone.xml into a
5-char set of identifiers that produces strings that are distinct from
UN/LOCODEs per

http://www.unece.org/fileadmin/DAM/cefact/locode/unlocode_manual.pdf
3.2.1 "However, where all permutations available for a country have
been exhausted, the numerals 2-9 may also be used."

On Sat, May 26, 2012 at 9:59 PM, Tobias Conradi
<tobias.conradi at gmail.com> wrote:
> Steven, Mark,
>
> I checked the latest timezone.xml contained in core.zip linked from
> http://cldr.unicode.org/index/bcp47-extension
...
> Since UN/LOCODE doesn't use the numbers 0 and 1, I created private
> codes using "1" in third position, so for Santa Isabel I would use
>
> MX1SI or in lower case mx1si
>
> for Hebron PS1HB, Gaza PS1GZ
>
> That way the codes all can be of the same length, namely 5 characters.

The utc based codes could be converted to 5 char too, replacing utc with zz:
utce01 -> zze01
utcw12 -> zzw12

UTC itself could be:
utc -> zz000

Unkown could be:
unk -> zzunk or zz1un

The use of 0 and 1 ensure there is no clash with UN/LOCODEs.

Here are some more possible mappings for identifiers that are not 5 char long:
usndnsl -> usnqy (UN/LOCODE USNQY)
usndcnt -> uszt8 (UN/LOCODE USZT8)

Handmade codes using "1" and as of assignment using the correct ISO
3166-1 alpha-2 code:
gaza -> ps1gz
gldkshvn -> gl1dm
hebron -> ps1hb
jeruslm -> il1jr
mxstis -> mx1si
usnavajo -> us1nv
usinvev -> us1vv

That would leave only four US specific codes:
cst6cdt
est5edt
mst7mdt
pst8pdt

In case they could be changed, they could be:
us1c6
us1e5
us1m7
us1p8

-- 
Tobias Conradi
Rheinsberger Str. 18
10115 Berlin
Germany

http://tobiasconradi.com/



More information about the tz mailing list