[twitter-dev] want to get a frequency count of all words on twitter, 1 time/day

2009-12-02 Thread hydrodog
The twitter API allows us to collect the top 10 keywords, but what we want is a lot of words (100,000 perhaps?) but only once per day. Obviously, with a firehose, we could do the work ourselves, but it seems obvious that internally, such a keyword list must exist, so is there any chance to get

Re: [twitter-dev] want to get a frequency count of all words on twitter, 1 time/day

2009-12-02 Thread John Kalucki
Subscribe to a spritzer feed on the Streaming API: http://stream.twitter.com/1/statuses/sample.xml and you can work out your own word frequency distribution to a rather high precision. After you have done so, I bet you'll write your own Trending Topics algorithm. It's physically impossible not to.