Sampling should be plenty for demo purposes. Learning language models by using the geo code as a starting point sounds like a quick thing to try.
Clustering with the tags you mentioned as a seed would be very interesting as well. On Wed, Jan 20, 2010 at 5:16 PM, Olivier Grisel <[email protected]>wrote: > > 1. access to the data, although I'm sure the ASF could work something out > > here > > Firehose (the live complete twitter stream) is going to be open to the > public this year. In the mean time the mean time it is possible to > gain access to a sample stream and to perform adhoc search queries on > specific terms or user profiles. > -- Ted Dunning, CTO DeepDyve
