<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Google translate strikes again</title>
	<atom:link href="http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/</link>
	<description>a linguist without a language</description>
	<lastBuildDate>Fri, 11 Jun 2010 04:48:25 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>By: jangari</title>
		<link>http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/comment-page-1/#comment-6327</link>
		<dc:creator>jangari</dc:creator>
		<pubDate>Mon, 16 Jun 2008 23:31:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.matjjin-nehen.com/?p=219#comment-6327</guid>
		<description>Yeah Wamut, good point.

I guess Korean is the only language (found so far) for which Rudd Wine Shop have been successful...

Maybe there&#039;s far too much material in Cantonese and Mandarin that provides better translations for &quot;Rudd&quot; and as such, the system won&#039;t be so amenable to suggestions from the public.</description>
		<content:encoded><![CDATA[<p>Yeah Wamut, good point.</p>
<p>I guess Korean is the only language (found so far) for which Rudd Wine Shop have been successful&#8230;</p>
<p>Maybe there&#8217;s far too much material in Cantonese and Mandarin that provides better translations for &#8220;Rudd&#8221; and as such, the system won&#8217;t be so amenable to suggestions from the public.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: wamut</title>
		<link>http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/comment-page-1/#comment-6308</link>
		<dc:creator>wamut</dc:creator>
		<pubDate>Mon, 16 Jun 2008 08:52:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.matjjin-nehen.com/?p=219#comment-6308</guid>
		<description>Then why only for Korean?</description>
		<content:encoded><![CDATA[<p>Then why only for Korean?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jangari</title>
		<link>http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/comment-page-1/#comment-6296</link>
		<dc:creator>jangari</dc:creator>
		<pubDate>Sun, 15 Jun 2008 23:23:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.matjjin-nehen.com/?p=219#comment-6296</guid>
		<description>K. Smith and Simon, on reflection I think it&#039;s something else entirely. And unfortunately it&#039;s probably much more mundane than either covert epenthetic advertisements or sophisticated corpus translation technologies gone wrong. But thanks for the suggestions all the same.

Firstly, Korean is not spoken in Hong Kong; not commonly anyway. So I doubt Google Translate would insert this advert for a translation from English to Korean, but not for a translation from English to Mandarin or Cantonese. Furthermore, as Simon points out, Google always identify paid content as advertising, so this should be no different.

Secondly, for the overgeneralised corpus technique of translation to be the culprit, you&#039;d have to say that there is some English corpus text somewhere that has a corresponding Korean text, wherein, if the English version has the string &lt;i&gt;Rudd&lt;/i&gt;, then the Korean version has the string &lt;i&gt;Rudd 전화 +852 2907 2112&lt;/i&gt;. I think this is unlikely because, first, surely the English text would similarly have the phone number of the wine shop - it&#039;s more of an English speaking region than Korean - and second, Google&#039;s sophisticated algorithms would probably throw up a red flag when a single word in English corresponds in Korean with the very same word, two Korean words, and a phone number complete with international telephony prefix.

What I think is going on is much less exciting. Notice at the bottom of the Google Translate page is a javascript link reading &lt;i&gt;Suggest a better translation&lt;/i&gt;. Clicking it brings up a text box where you type in a better translation that the one that Google is giving you. Next to the box is this description:
&lt;blockquote&gt;We&#039;ll use your suggestion to improve translation quality in future updates to our system.&lt;/blockquote&gt;
Would Google sift through probably millions of translation suggestions every day and make a judgement for each? Not likely. I think what is likely is that if the exact same string gets suggested as a translation for some other string, a certain number of times, then Google&#039;s algorithms will accept it.

Ergo, Rudd Wine Shop has diligently and deliberately inserted this ad into Google&#039;s highly fallible system.</description>
		<content:encoded><![CDATA[<p>K. Smith and Simon, on reflection I think it&#8217;s something else entirely. And unfortunately it&#8217;s probably much more mundane than either covert epenthetic advertisements or sophisticated corpus translation technologies gone wrong. But thanks for the suggestions all the same.</p>
<p>Firstly, Korean is not spoken in Hong Kong; not commonly anyway. So I doubt Google Translate would insert this advert for a translation from English to Korean, but not for a translation from English to Mandarin or Cantonese. Furthermore, as Simon points out, Google always identify paid content as advertising, so this should be no different.</p>
<p>Secondly, for the overgeneralised corpus technique of translation to be the culprit, you&#8217;d have to say that there is some English corpus text somewhere that has a corresponding Korean text, wherein, if the English version has the string <i>Rudd</i>, then the Korean version has the string <i>Rudd 전화 +852 2907 2112</i>. I think this is unlikely because, first, surely the English text would similarly have the phone number of the wine shop &#8211; it&#8217;s more of an English speaking region than Korean &#8211; and second, Google&#8217;s sophisticated algorithms would probably throw up a red flag when a single word in English corresponds in Korean with the very same word, two Korean words, and a phone number complete with international telephony prefix.</p>
<p>What I think is going on is much less exciting. Notice at the bottom of the Google Translate page is a javascript link reading <i>Suggest a better translation</i>. Clicking it brings up a text box where you type in a better translation that the one that Google is giving you. Next to the box is this description:</p>
<blockquote><p>We&#8217;ll use your suggestion to improve translation quality in future updates to our system.</p></blockquote>
<p>Would Google sift through probably millions of translation suggestions every day and make a judgement for each? Not likely. I think what is likely is that if the exact same string gets suggested as a translation for some other string, a certain number of times, then Google&#8217;s algorithms will accept it.</p>
<p>Ergo, Rudd Wine Shop has diligently and deliberately inserted this ad into Google&#8217;s highly fallible system.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Simon</title>
		<link>http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/comment-page-1/#comment-6247</link>
		<dc:creator>Simon</dc:creator>
		<pubDate>Sat, 14 Jun 2008 09:32:01 +0000</pubDate>
		<guid isPermaLink="false">http://www.matjjin-nehen.com/?p=219#comment-6247</guid>
		<description>An explanation might be that we have encountered a limitation in Google&#039;s implementation of computational translation. My limited understanding of Google Translate is that it looks at two parallel texts each in a different language and then tries to generalise from there when we come along with our translation request. So, to speculate wildly, it may be that they have an English text which refers to the Rudd wine shop without phone number and a parallel Korean text which includes the phone number hence the mistake.

This may be more likely than Google inserting advertisements on the sly as generally they are required and do identify that which is advertisement and that which is not quite clearly.</description>
		<content:encoded><![CDATA[<p>An explanation might be that we have encountered a limitation in Google&#8217;s implementation of computational translation. My limited understanding of Google Translate is that it looks at two parallel texts each in a different language and then tries to generalise from there when we come along with our translation request. So, to speculate wildly, it may be that they have an English text which refers to the Rudd wine shop without phone number and a parallel Korean text which includes the phone number hence the mistake.</p>
<p>This may be more likely than Google inserting advertisements on the sly as generally they are required and do identify that which is advertisement and that which is not quite clearly.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: K. Smith</title>
		<link>http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/comment-page-1/#comment-6217</link>
		<dc:creator>K. Smith</dc:creator>
		<pubDate>Fri, 13 Jun 2008 14:55:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.matjjin-nehen.com/?p=219#comment-6217</guid>
		<description>I think I figured out some of what&#039;s going on:

In your original article, you have the following quote: &quot;And since the text of Kevin Rudd’s first parliamentary act as Prime Minister...&quot;.  I noticed that it contained the word &quot;Rudd&quot;, which you had mentioned was &#039;attached&#039;, so to speak, to the phone number, and which also happens to be the name of the wine shop the phone number goes to.  So I decided to type it into Google Translate.  When you translate just the word &quot;Rudd&quot; (English to Korean), it returns &quot;Rudd 전화 +852 2907 2112&quot;.  So I think this is where the translation came from.

As for why it did this, it reminds me of the Google ads in Gmail that try to match what is contained in your current email.  Since there is a Rudd wine shop in Hong Kong, and they were translating an entire webpage into Korean, maybe it tried to create a related link and/or advertise for something (?) and they ended up with that number.  I also noticed that this does not occur if you try to translate &quot;Rudd&quot; into any other languages, so it seems the translated language and the regions it is often spoken in come into play somehow.  Maybe you could try translating other words from a given language into English which would have an associated phone number with them?  I tried &quot;Forzieri&quot; from Italian to English, but it didn&#039;t come up with anything interesting.

Anyway, this reminds me of those short murder mysteries... following the lead until you find out &quot;who done it&quot;.  =)</description>
		<content:encoded><![CDATA[<p>I think I figured out some of what&#8217;s going on:</p>
<p>In your original article, you have the following quote: &#8220;And since the text of Kevin Rudd’s first parliamentary act as Prime Minister&#8230;&#8221;.  I noticed that it contained the word &#8220;Rudd&#8221;, which you had mentioned was &#8216;attached&#8217;, so to speak, to the phone number, and which also happens to be the name of the wine shop the phone number goes to.  So I decided to type it into Google Translate.  When you translate just the word &#8220;Rudd&#8221; (English to Korean), it returns &#8220;Rudd 전화 +852 2907 2112&#8243;.  So I think this is where the translation came from.</p>
<p>As for why it did this, it reminds me of the Google ads in Gmail that try to match what is contained in your current email.  Since there is a Rudd wine shop in Hong Kong, and they were translating an entire webpage into Korean, maybe it tried to create a related link and/or advertise for something (?) and they ended up with that number.  I also noticed that this does not occur if you try to translate &#8220;Rudd&#8221; into any other languages, so it seems the translated language and the regions it is often spoken in come into play somehow.  Maybe you could try translating other words from a given language into English which would have an associated phone number with them?  I tried &#8220;Forzieri&#8221; from Italian to English, but it didn&#8217;t come up with anything interesting.</p>
<p>Anyway, this reminds me of those short murder mysteries&#8230; following the lead until you find out &#8220;who done it&#8221;.  =)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jangari</title>
		<link>http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/comment-page-1/#comment-6202</link>
		<dc:creator>jangari</dc:creator>
		<pubDate>Fri, 13 Jun 2008 10:03:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.matjjin-nehen.com/?p=219#comment-6202</guid>
		<description>Did I mention that the wine shop in question is actually called &lt;i&gt;Rudd Wine Shop&lt;/i&gt;?

Is this a new form of advertising? By stealth on Google Translate?

The plot thickens.</description>
		<content:encoded><![CDATA[<p>Did I mention that the wine shop in question is actually called <i>Rudd Wine Shop</i>?</p>
<p>Is this a new form of advertising? By stealth on Google Translate?</p>
<p>The plot thickens.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jangari</title>
		<link>http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/comment-page-1/#comment-6201</link>
		<dc:creator>jangari</dc:creator>
		<pubDate>Fri, 13 Jun 2008 10:00:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.matjjin-nehen.com/?p=219#comment-6201</guid>
		<description>Ah Claire, you&#039;re too quick!

My housemate just rang it and got the wine shop. I was about to post an update, but you beat me to it.

But we&#039;s getting some fine wine from Hong Kong!</description>
		<content:encoded><![CDATA[<p>Ah Claire, you&#8217;re too quick!</p>
<p>My housemate just rang it and got the wine shop. I was about to post an update, but you beat me to it.</p>
<p>But we&#8217;s getting some fine wine from Hong Kong!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Machka</title>
		<link>http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/comment-page-1/#comment-6199</link>
		<dc:creator>Machka</dc:creator>
		<pubDate>Fri, 13 Jun 2008 09:45:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.matjjin-nehen.com/?p=219#comment-6199</guid>
		<description>Mmmm Niagra wines....*drools*</description>
		<content:encoded><![CDATA[<p>Mmmm Niagra wines&#8230;.*drools*</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Claire</title>
		<link>http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/comment-page-1/#comment-6198</link>
		<dc:creator>Claire</dc:creator>
		<pubDate>Fri, 13 Jun 2008 09:43:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.matjjin-nehen.com/?p=219#comment-6198</guid>
		<description>What&#039;s even more intriguing is that it appears to be the phone number for a Hong Kong wine shop that sells &#039;fine Niagara wines&#039; ...

And what&#039;s more intriguing still is that your blog thinks I&#039;ve already posted this comment when I haven&#039;t pressed submit yet...</description>
		<content:encoded><![CDATA[<p>What&#8217;s even more intriguing is that it appears to be the phone number for a Hong Kong wine shop that sells &#8216;fine Niagara wines&#8217; &#8230;</p>
<p>And what&#8217;s more intriguing still is that your blog thinks I&#8217;ve already posted this comment when I haven&#8217;t pressed submit yet&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Claire</title>
		<link>http://www.matjjin-nehen.com/2008/06/13/google-translate-strikes-again/comment-page-1/#comment-6197</link>
		<dc:creator>Claire</dc:creator>
		<pubDate>Fri, 13 Jun 2008 09:41:37 +0000</pubDate>
		<guid isPermaLink="false">http://www.matjjin-nehen.com/?p=219#comment-6197</guid>
		<description>What&#039;s even more intriguing is that it appears to be the phone number for a Hong Kong wine shop that sells &#039;fine Niagara wines&#039; ...</description>
		<content:encoded><![CDATA[<p>What&#8217;s even more intriguing is that it appears to be the phone number for a Hong Kong wine shop that sells &#8216;fine Niagara wines&#8217; &#8230;</p>
]]></content:encoded>
	</item>
</channel>
</rss>
