John,
The reason for viewing an aggregate count rather than individual
methods is to have a much more sensitive way of detecting a new
topic. Sampling will only give us a good representative sample above
the threshhold, once it's already quite popular. We'd like to be able
to follow a topic
The twitter API allows us to collect the top 10 keywords, but what we
want is a lot of words (100,000 perhaps?) but only once per day.
Obviously, with a firehose, we could do the work ourselves, but it
seems obvious that internally, such a keyword list must exist, so is
there any chance to get