<html>
<head>
<meta content="text/html; charset=windows-1252"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
Hello Again All,<br>
<br>
Please find attached a new version of my spreadsheet, which now
includes about 15 or so missing comments. All of them were
templated.<br>
<br>
Graeme<br>
<br>
<div class="moz-cite-prefix">On 2015-07-13 11:07 PM, Graeme Bunton
wrote:<br>
</div>
<blockquote cite="mid:55A47CEB.9010308@tucows.com" type="cite">Hi
All,
<br>
<br>
I had a kind developer at Tucows 'screen scrape' all of the PPSAI
public comments. This means they wrote a program that essentially
visited all*of the comments submitted and captured:
<br>
-the sender
<br>
-the subject
<br>
-the body of the message
<br>
- a url for any attachments
<br>
- a url for the comment itself online
<br>
<br>
To those fields I've added:
<br>
- Has Attachment (Y/N) - this allows for easy filtering of
comments with attachments
<br>
- NameCheap (Y/N) - this flag is generated if the message body
contains the words "regardless of whether the request comes from a
private individual" which comes from the templated namecheap
comments.**
<br>
-Word count - allows for sorting by comment length
<br>
<br>
Screenscraping is never exact, and it has a tough time with some
formatting. By and large though it's pretty good and I've found it
useful so far for triaging and prioritizing comments.
<br>
<br>
Caveats:
<br>
* I know it's missing about 15 or so comments, I've figured out a
way to identify which are missing and will send those along
tomorrow.
<br>
** There are many namecheap comments where the sender chose to
write their own text and therefore the above phrase is not
included, these don't have the Y flag. Similarly, many with the
flag will include extra content the sender chose to add. These
can be identified by applying a filter on the namecheap column and
then sorting by wordcount. The above phrase was chosen because it
was long enough to be unlikely to show up in in other comments.
This is obviously not perfect at identify those comments.
<br>
<br>
With a bit of excel expertise you should be able to filter and
sort the submitted comments as you see fit.
<br>
<br>
We have an obligation to read what's been submitted, and I hope
you find the attached makes reading the comments easier, and that
it's helpful in understanding what the public is telling us.
<br>
<br>
Graeme
<br>
<br>
(Also, apologies to ICANN for the punishment we gave their
webservers while testing and scraping the comments)
<br>
<br>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
Gnso-ppsai-pdp-wg mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Gnso-ppsai-pdp-wg@icann.org">Gnso-ppsai-pdp-wg@icann.org</a>
<a class="moz-txt-link-freetext" href="https://mm.icann.org/mailman/listinfo/gnso-ppsai-pdp-wg">https://mm.icann.org/mailman/listinfo/gnso-ppsai-pdp-wg</a></pre>
</blockquote>
<br>
<pre class="moz-signature" cols="72">--
_________________________
Graeme Bunton
Manager, Management Information Systems
Manager, Public Policy
Tucows Inc.
PH: 416 535 0123 ext 1634</pre>
</body>
</html>