Sampling should be plenty for demo purposes.

Learning language models by using the geo code as a starting point sounds
like a quick thing to try.

Clustering with the tags you mentioned as a seed would be very interesting
as well.

On Wed, Jan 20, 2010 at 5:16 PM, Olivier Grisel <[email protected]>wrote:

> > 1. access to the data, although I'm sure the ASF could work something out
> > here
>
> Firehose (the live complete twitter stream) is going to be open to the
> public this year. In the mean time the mean time it is possible to
> gain access to a sample stream and to perform adhoc search queries on
> specific terms or user profiles.
>



-- 
Ted Dunning, CTO
DeepDyve

Reply via email to