[tz] TZ file comments UTF-8?

Markus Kuhn Markus.Kuhn at cl.cam.ac.uk
Thu Feb 14 14:10:42 UTC 2013


Paul Eggert wrote on 2013-01-13 07:23 UTC:
> On 01/12/2013 10:00 PM, Jaakko Hyvätti wrote:
> > sooner or later it will be utf-8 as it has been
> > for the rest of the world.
> 
> In the long term UTF-8 is no doubt the way to go, but we
> might not be quite there yet.

UTF-8 has been very well supported by all major Unix/Linux distibutions
for nearly a decade now, and has been the default locale in most new
installations since about 2005.

I strongly recommend that you use

  unset LC_ALL
  LANG=en_US.UTF-8        # or en_GB.UTF-8, ... outside the US
  LC_COLLATE=POSIX        # to avoid surprises with sorting order and globbing

With that, I hope that you will quickly find UTF-8 perfectly acceptable
now in source code.

UTF-8 has long become common practice in comments of many open-source
projects.

Markus

http://www.cl.cam.ac.uk/~mgk25/unicode.html#collate

-- 
Markus Kuhn, Computer Laboratory, University of Cambridge
http://www.cl.cam.ac.uk/~mgk25/ || CB3 0FD, Great Britain



More information about the tz mailing list