I'd like to get somewhere around 100GB of tweets. It doesn't matter where they are from, when they were sent, etc. I'd just like to have a relatively large collection of data to use as assignment data for a class I'm teaching that uses Hadoop.
Is such a collection available for download anywhere, or is there an existing program I could use to simply record twitter data for some period of time? (I've heard about both the firehose and the streaming API, but can't seem to find anything that is ready to run with that for this particular task....but I might not know where to look). Cordially, Ted --- Ted Pedersen http://www.d.umn.edu/~tpederse -- Twitter developer documentation and resources: http://dev.twitter.com/doc API updates via Twitter: http://twitter.com/twitterapi Issues/Enhancements Tracker: http://code.google.com/p/twitter-api/issues/list Change your membership to this group: http://groups.google.com/group/twitter-development-talk