[UA-discuss] Spam Filtering
Hazem Hezzah
hhezzah.las at gmail.com
Thu Feb 9 12:18:13 UTC 2017
Just reminding you that non-ASCII characters are included in most European languages (umlaut, accent....etc.) using Latin scripts.
So, consider that when you take your decision to ban those emails too.
Regards,
Hazem Hezzah
-----Original Message-----
From: ua-discuss-bounces at icann.org [mailto:ua-discuss-bounces at icann.org] On Behalf Of Asmus Freytag
Sent: Wednesday, 08 February, 2017 19:57
To: ua-discuss at icann.org
Subject: Re: [UA-discuss] Spam Filtering
On 2/8/2017 9:31 AM, Andrew Sullivan wrote:
> On Wed, Feb 08, 2017 at 04:47:52PM +0000, Andre Schappo wrote:
>> Recently I tested sending email from my DataMail 小山@电邮.在线 to some
>> people on gmail. Mostly they report that my email to them goes into
>> their gmail Spam/Junk folder. I deduce that the gmail spam filtering
>> software is treating chinese/unicode email addresses less favourably
>> than ascii email addresses.
> That seems like a leap. How many of those people speak Chinese
> natively?
>
> Most spam systems are Bayesian and have been taught to look for
> patterns that match other spam. Everyone who has taught gmail to
> recognize "Han in the headers == spam" will have to un-teach it that.
> I don't know about you, but since I'm not a native user of Han
> characters I have certainly taught my spam filters such rules of
> thumb.
>
> A
Precisely. As a forum admin, I would cheerfully ban all non-ASCII e-mails for registration, because my user base is not world-wide, but the spammers are. Only reason I am not doing that today, is that the software doesn't know about non-ascii e-mails yet :)
A./
More information about the UA-discuss
mailing list