[UA-discuss] [UA-International] IDN-as-punycode-encoded-label in Baidu search engine results

Tan Tanaka, Dennis dtantanaka at verisign.com
Tue Nov 24 15:48:29 UTC 2015


Jiankang,

Thanks for your offer, that’d be extremely helpful. Let me know if we need to document the “bug” in a different way before submitting to Baidu. I’ll be happy to collaborate with you.

On a related note, I received second hand information on how Baidu crawls and indexes websites and how IDNs are treated different, which results in the behavior we just described below. Do you happen to know any of Baidu’s web-crawling practices?

Thanks again,
Dennis


From: Jiankang Yao [mailto:yaojk at cnnic.cn]
Sent: Tuesday, November 24, 2015 12:41 AM
To: Tan Tanaka, Dennis; ua-discuss
Cc: ua-international at icann.org
Subject: Re: [UA-International] IDN-as-punycode-encoded-label in Baidu search engine results


I can help to talk to baidu and forward your message to them.

________________________________
Jiankang Yao

From: Tan Tanaka, Dennis<mailto:dtantanaka at verisign.com>
Date: 2015-11-24 05:45
To: UA-discuss at icann.org<mailto:UA-discuss at icann.org>
CC: ua-international at icann.org<mailto:ua-international at icann.org>
Subject: [UA-International] IDN-as-punycode-encoded-label in Baidu search engine results
Often times I hear that IDNs are not indexed by certain search engines. While I know this is not true, the example below doesn’t help my case either (at least not 100%). Here is an example where the IDN I’m looking for is showing up in the first 5 search results on Baidu (see picture below). However, the string is displayed as the punycode-encoded label instead of the corresponding Chinese IDN (i.e. xn--ebr05n.com) .

Google and Yandex appear to work as expected. Bing didn’t display the domain name in the results (first two pages).

Is there someone interested (and with the language skills) in taking the action item to reach out to Baidu? This might be in the form of opening a bug ticket to explain the problem (IDN is displayed as punycode-encoded label. Example: xn--ebr05.com) and what the expected result should have been (IDN displayed as Chinese domain nam. Example: 墨刀.com).


[cid:image001.png at 01D126A2.1EAEBAA0]



Dennis Tan
Sr. Product Manager
Naming Services
DTanTanaka at Verisign.com<mailto:DTanTanaka at Verisign.com>

m: 571-246-7303 t: 703-948-4197
12061 Bluemont Way, Reston, VA 20190

VerisignInc.com<http://www.verisigninc.com/>

[Verisign™]



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mm.icann.org/pipermail/ua-discuss/attachments/20151124/04a41572/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 43511 bytes
Desc: image001.png
URL: <http://mm.icann.org/pipermail/ua-discuss/attachments/20151124/04a41572/image001.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image002.gif
Type: image/gif
Size: 131 bytes
Desc: image002.gif
URL: <http://mm.icann.org/pipermail/ua-discuss/attachments/20151124/04a41572/image002.gif>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image003.gif
Type: image/gif
Size: 3105 bytes
Desc: image003.gif
URL: <http://mm.icann.org/pipermail/ua-discuss/attachments/20151124/04a41572/image003.gif>


More information about the UA-discuss mailing list