<div dir="ltr">I&#39;d say those cons are pretty significant - I find it very significantly harder to read than the format I&#39;ve propsed. I&#39;m also confused by your &quot;pro&quot; that it doesn&#39;t depend on any implementation details, but it really <i>exposes</i> the implementation details in naming (&quot;isdst&quot; and &quot;gmtoff&quot; for example, along with the mysterious huge numeric values).<div>The aim is to have a format which is easy and natural to generate on <i>multiple</i> platforms, so that authors of other code parsing the data can validate against it. That would be a <i>very</i> unnatural format to generate from .NET with either Noda Time or TimeZoneInfo, or from Java 7, Java 8 or Joda Time.<br></div><div><br></div><div>I&#39;d certainly be in favour of my &quot;zicdump&quot; C# code being rewritten in C (and maybe even offered as a different format for zdump, based on a command-line flag) but I see very little benefit in adopting a format which doesn&#39;t seem to have been designed for the same purpose as the tzvalidate one.</div><div><br></div><div>Now having said all that, I&#39;m very happy to tweak the format (and have already done so based on earlier suggestions) - but I wouldn&#39;t want to use the zdump -v format just because it already exists, if it&#39;s not really fit for purpose.</div><div><br></div><div>Jon</div><div><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On 27 April 2016 at 16:51, Random832 <span dir="ltr">&lt;<a href="mailto:random832@fastmail.com" target="_blank">random832@fastmail.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">I missed this the first time around...<br>

<span class=""><br>

&gt; On 11 July 2015 at 11:35, Jon Skeet &lt;<a href="mailto:skeet@pobox.com">skeet@pobox.com</a>&gt; wrote:<br>

&gt;<br>

</span>&gt; &gt; Background: I&#39;m the primary developer for Noda Time &lt;<a href="http://nodatime.org" rel="noreferrer" target="_blank">http://nodatime.org</a>&gt; which<br>

<span class="">&gt; &gt; consumes the tz data. I&#39;m currently refactoring the code to do this... and<br>

&gt; &gt; I&#39;ve come across some code (originally ported from Joda Time) which I now<br>

&gt; &gt; understand in terms of what it&#39;s doing, but not exactly why.<br>

&gt; &gt;<br>

&gt; &gt; For a little while now, the Noda Time source repo has included a text<br>

&gt; &gt; dump file<br>

</span>&gt; &gt; &lt;<a href="https://github.com/nodatime/nodatime/blob/master/src/NodaTime.Test/TestData/tzdb-dump.txt" rel="noreferrer" target="_blank">https://github.com/nodatime/nodatime/blob/master/src/NodaTime.Test/TestData/tzdb-dump.txt</a>&gt;,<br>

<span class="">&gt; &gt; containing a text dump of every transition (up to 2100, at the moment) for<br>

&gt; &gt; every time zone. It looks like this, picking just one example:<br>

&gt; &gt;<br>

&gt; &gt; Zone: Africa/Maseru<br>

&gt; &gt; LMT: [StartOfTime, 1892-02-07T22:08:00Z) +01:52 (+00)<br>

&gt; &gt; SAST: [1892-02-07T22:08:00Z, 1903-02-28T22:30:00Z) +01:30 (+00)<br>

&gt; &gt; SAST: [1903-02-28T22:30:00Z, 1942-09-20T00:00:00Z) +02 (+00)<br>

&gt; &gt; SAST: [1942-09-20T00:00:00Z, 1943-03-20T23:00:00Z) +03 (+01)<br>

&gt; &gt; SAST: [1943-03-20T23:00:00Z, 1943-09-19T00:00:00Z) +02 (+00)<br>

&gt; &gt; SAST: [1943-09-19T00:00:00Z, 1944-03-18T23:00:00Z) +03 (+01)<br>

&gt; &gt; SAST: [1944-03-18T23:00:00Z, EndOfTime) +02 (+00)<br>

<br>

</span>...<br>

<span class=""><br>

&gt; &gt; Any thoughts? If the feeling is broadly positive, the next step would be<br>

&gt; &gt; to nail down the text format, then find a willing victim/volunteer to write<br>

&gt; &gt; the C code. (You really don&#39;t want me writing C...)<br>

<br>

</span>What&#39;s wrong with zdump&#39;s output format?<br>

<br>

$ zdump -v Africa/Maseru<br>

Africa/Maseru  -9223372036854775808 = NULL<br>

Africa/Maseru  -9223372036854689408 = NULL<br>

Africa/Maseru  Sun Feb  7 22:07:59 1892 UTC = Sun Feb  7 23:59:59 1892<br>

LMT isdst=0 gmtoff=6720<br>

Africa/Maseru  Sun Feb  7 22:08:00 1892 UTC = Sun Feb  7 23:38:00 1892<br>

SAST isdst=0 gmtoff=5400<br>

Africa/Maseru  Sat Feb 28 22:29:59 1903 UTC = Sat Feb 28 23:59:59 1903<br>

SAST isdst=0 gmtoff=5400<br>

Africa/Maseru  Sat Feb 28 22:30:00 1903 UTC = Sun Mar  1 00:30:00 1903<br>

SAST isdst=0 gmtoff=7200<br>

Africa/Maseru  Sat Sep 19 23:59:59 1942 UTC = Sun Sep 20 01:59:59 1942<br>

SAST isdst=0 gmtoff=7200<br>

Africa/Maseru  Sun Sep 20 00:00:00 1942 UTC = Sun Sep 20 03:00:00 1942<br>

SAST isdst=1 gmtoff=10800<br>

Africa/Maseru  Sat Mar 20 22:59:59 1943 UTC = Sun Mar 21 01:59:59 1943<br>

SAST isdst=1 gmtoff=10800<br>

Africa/Maseru  Sat Mar 20 23:00:00 1943 UTC = Sun Mar 21 01:00:00 1943<br>

SAST isdst=0 gmtoff=7200<br>

Africa/Maseru  Sat Sep 18 23:59:59 1943 UTC = Sun Sep 19 01:59:59 1943<br>

SAST isdst=0 gmtoff=7200<br>

Africa/Maseru  Sun Sep 19 00:00:00 1943 UTC = Sun Sep 19 03:00:00 1943<br>

SAST isdst=1 gmtoff=10800<br>

Africa/Maseru  Sat Mar 18 22:59:59 1944 UTC = Sun Mar 19 01:59:59 1944<br>

SAST isdst=1 gmtoff=10800<br>

Africa/Maseru  Sat Mar 18 23:00:00 1944 UTC = Sun Mar 19 01:00:00 1944<br>

SAST isdst=0 gmtoff=7200<br>

Africa/Maseru  9223372036854689407 = NULL<br>

Africa/Maseru  9223372036854775807 = NULL<br>

<br>

Cons:<br>

- A bit verbose<br>

- technically uses instants (from before and on each transition) rather<br>

than spans.<br>

- The NULLs are a bit mysterious<br>

- I&#39;m personally not sure *exactly* how it finds the transitions, and in<br>

particular I&#39;m not sure if it will reliably find multiple transitions<br>

per day<br>

<br>

Pros:<br>

- Already exists<br>

- Is already written in C, and already installed on many systems<br>

- Does not depend on any implementation internals<br>

</blockquote></div><br></div>