<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Making twitter multilingual with a hack of the Google Translation API</title>
	<atom:link href="http://www.wait-till-i.com/2008/03/31/making-twitter-multilingual-with-a-hack-of-the-google-translation-api/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.wait-till-i.com/2008/03/31/making-twitter-multilingual-with-a-hack-of-the-google-translation-api/</link>
	<description>Chris Heilmann - Accessibility, Web Development and Pragmatism - can talk, will travel</description>
	<pubDate>Fri, 05 Dec 2008 10:47:39 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
		<item>
		<title>By: Alex Francis</title>
		<link>http://www.wait-till-i.com/2008/03/31/making-twitter-multilingual-with-a-hack-of-the-google-translation-api/#comment-8600</link>
		<dc:creator>Alex Francis</dc:creator>
		<pubDate>Fri, 14 Nov 2008 10:13:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.wait-till-i.com/2008/03/31/making-twitter-multilingual-with-a-hack-of-the-google-translation-api/#comment-8600</guid>
		<description>Hi Chris

Ref "However, if you want to use this in a loop you are out of luck" - I *think* you can use a closure to get around this. My js is not top-notch so sorry if I'm wrong, but something like the following seems to work ok:

        function requestOne(text, id) {
            google.language.detect(text, function(result) {
              if (!result.error) {
                var language = 'unknown';
                for (l in google.language.Languages) {
                  if (google.language.Languages[l] == result.language) {
                    language = l;
                    break;
                  }
                }
                var container = document.getElementById(id);
                container.innerHTML = text + " is: " + language + "";
              }
            });
        }
        
        for (var i = 0; i &#60; texts.length; i++) {
            this_text = texts[i];
            this_id = ids[i];
            requestOne( this_text, this_id );
        }

Am I hallucinating?

Alex</description>
		<content:encoded><![CDATA[<p>Hi Chris</p>
<p>Ref &#8220;However, if you want to use this in a loop you are out of luck&#8221; - I <strong>think</strong> you can use a closure to get around this. My js is not top-notch so sorry if I&#8217;m wrong, but something like the following seems to work ok:</p>
<p>        function requestOne(text, id) {<br />
            google.language.detect(text, function(result) {<br />
              if (!result.error) {<br />
                var language = &#8216;unknown&#8217;;<br />
                for (l in google.language.Languages) {<br />
                  if (google.language.Languages[l] == result.language) {<br />
                    language = l;<br />
                    break;<br />
                  }<br />
                }<br />
                var container = document.getElementById(id);<br />
                container.innerHTML = text + &#8221; is: &#8221; + language + &#8220;&#8221;;<br />
              }<br />
            });<br />
        }</p>
<p>        for (var i = 0; i &lt; texts.length; i++) {<br />
            this_text = texts[i];<br />
            this_id = ids[i];<br />
            requestOne( this_text, this_id );<br />
        }</p>
<p>Am I hallucinating?</p>
<p>Alex</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ricardo Niederberger Cabral</title>
		<link>http://www.wait-till-i.com/2008/03/31/making-twitter-multilingual-with-a-hack-of-the-google-translation-api/#comment-8294</link>
		<dc:creator>Ricardo Niederberger Cabral</dc:creator>
		<pubDate>Sun, 21 Sep 2008 20:49:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.wait-till-i.com/2008/03/31/making-twitter-multilingual-with-a-hack-of-the-google-translation-api/#comment-8294</guid>
		<description>@Mark: I've been collecting statistics on the &lt;a href="http://blog.isnotworking.com/2008/09/language-most-spoken-on-twitter.html" rel="nofollow"&gt;most used language at Twitter&lt;/a&gt;.</description>
		<content:encoded><![CDATA[<p>@Mark: I&#8217;ve been collecting statistics on the <a href="http://blog.isnotworking.com/2008/09/language-most-spoken-on-twitter.html" rel="nofollow">most used language at Twitter</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark Neigh</title>
		<link>http://www.wait-till-i.com/2008/03/31/making-twitter-multilingual-with-a-hack-of-the-google-translation-api/#comment-7502</link>
		<dc:creator>Mark Neigh</dc:creator>
		<pubDate>Fri, 18 Apr 2008 17:44:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.wait-till-i.com/2008/03/31/making-twitter-multilingual-with-a-hack-of-the-google-translation-api/#comment-7502</guid>
		<description>Hello, This is very cool. I am working on a paper about mobile social media and its effects on global English. Do you have any stats on %s of languages being used on Twitter? Or any other interesting bits you could share? </description>
		<content:encoded><![CDATA[<p>Hello, This is very cool. I am working on a paper about mobile social media and its effects on global English. Do you have any stats on %s of languages being used on Twitter? Or any other interesting bits you could share? </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Richard</title>
		<link>http://www.wait-till-i.com/2008/03/31/making-twitter-multilingual-with-a-hack-of-the-google-translation-api/#comment-7379</link>
		<dc:creator>Richard</dc:creator>
		<pubDate>Mon, 31 Mar 2008 19:09:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.wait-till-i.com/2008/03/31/making-twitter-multilingual-with-a-hack-of-the-google-translation-api/#comment-7379</guid>
		<description>Nice work. Definitely very cool.

It incorrectly identified Chinese as Italian, and Japanese as Slovak (I think; sk?) though. Which seems like pretty strange mistake to make, especially having heard that Google's translator is quite good at translating Chinese (and Arabic) in particular. Then again, all that code means nothing to me, though (the language bit is what interested me) so it might be some limitation with how the API parses the text? Or that might not even make any sense. It's a shame the demo page doesn't output the confidence values too. I'd be interested to see that.

Another potential issue might be correctly identifying tweets in one language, that happen to contain words of another language. For example, it showed this tweet: "GIF Sundsvall vs. Helsingborgs IF (Allsvenskan): Match has finished. Result: 0:3" (URL from tweet removed, lest I annoy your spam filters) as Swedish, despite obviously being English.

The code means very little to me, but the idea is definitely awesome. 

It's a shame automatic machine translation comes with so many inherent downfalls. Though I think Google's is one of the better ones, with quite a shallow approach to disambiguation, which would be the biggest hurdle (if you even intended to advance it to actually translating the given text).</description>
		<content:encoded><![CDATA[<p>Nice work. Definitely very cool.</p>
<p>It incorrectly identified Chinese as Italian, and Japanese as Slovak (I think; sk?) though. Which seems like pretty strange mistake to make, especially having heard that Google&#8217;s translator is quite good at translating Chinese (and Arabic) in particular. Then again, all that code means nothing to me, though (the language bit is what interested me) so it might be some limitation with how the <span class="caps">API </span>parses the text? Or that might not even make any sense. It&#8217;s a shame the demo page doesn&#8217;t output the confidence values too. I&#8217;d be interested to see that.</p>
<p>Another potential issue might be correctly identifying tweets in one language, that happen to contain words of another language. For example, it showed this tweet: &#8220;GIF Sundsvall vs. Helsingborgs IF (Allsvenskan): Match has finished. Result: 0:3&#8243; (URL from tweet removed, lest I annoy your spam filters) as Swedish, despite obviously being English.</p>
<p>The code means very little to me, but the idea is definitely awesome. </p>
<p>It&#8217;s a shame automatic machine translation comes with so many inherent downfalls. Though I think Google&#8217;s is one of the better ones, with quite a shallow approach to disambiguation, which would be the biggest hurdle (if you even intended to advance it to actually translating the given text).</p>
]]></content:encoded>
	</item>
</channel>
</rss>
