[twitter-dev] Re: Quick hack: using Twitter with Yahoo Placemaker to geolocate tweets

Christian Heilmann Thu, 28 May 2009 00:50:30 -0700


Brendan O'Connor wrote:

On Wed, May 27, 2009 at 8:08 PM, Nancy M <[email protected]<mailto:[email protected]>> wrote:
    I do like the maps, but 50% error -- you would not possibly get on an
    airplane with that kind of error rate, would you?  And I don't think
    I'd want to make decisions about my demographics on something with
    that error rate either.   Why not take the IPS and bounce them against
    whois or something?
This app isn't about that; it's about what places a person is talkingabout. You can't use their IP's, the point is to identify locationsin the text of their tweets. (I asked whether the app was looking atthe author's location to help disambiguate because i thought it couldbe used to improve accuracy; but this is hypothetical.)

Thanks, that is exactly the point, as explained in the only text on thepage:

"TweetLocations analyses twitter updates and checks if they contain anygeographical locations. Instead of relying on the Twitter location inyour user profile TweetLocations finds the locations you talked about."

:-)

In defense of error rates, if the task is just to get a sense aboutwhat regions of the world someone tends to talk about, then somethinglike a 10% or 20% error rate might be ok; and it was lower than thatfor Chris's and some of the other example twitter users the app wassuggesting.

Well, error rates are a good question. How would a dumb computer knowwhat the context is in 140 characters? Notice that if you use "My nameis Jack London and I live in Toronto" PlaceMaker ony shows Toronto,which is impressive!

But here's one case where errors are very bad. One thing I thoughtwas great about the map UI was that you can see a flag all by itselfout in mexico or something, and be curious what the person is sayingabout mexico, and click on it to see the message. If errors tend tobe geographic outliers then they really hurt this use case sincegeographic outliers are easy to see and are interesting simply becausethey are unusual ("oh, brendan's always boring and talks aboutcalifornia, but look, one time he talked about switzerland! oops, notreally.")

How could I work around that?

I think the issue with some of the errors the yahoo placemaker thingwas making with my tweets is, is that it's not integrating very wellprior information about how commonly those locations are talked about.I think "scala" is only rarely used to mean the switzerland canton,but is quite often used to mean the programming language; butplacemaker is happy to use a rare, unlikely sense of "scala" here.

Well, PlaceMaker is a DB of geographical locations (which you can evendownload - http://developer.yahoo.com/geo/geoplanet/data/) and doesn'tcompare with a DB of programming languages. It would be interesting tosee how it differs from the other (less open) services out there. MaybeI'll use Simon Willison's geocoders and only return if there is a match.http://github.com/simonw/geocoders/tree/master



regards
Chris

[twitter-dev] Re: Quick hack: using Twitter with Yahoo Placemaker to geolocate tweets

Reply via email to