Uh ... then again, http://blog.twitter.com/2010/04/tweet-preservation.html
;-)

On Apr 15, 1:04 am, zn...@comcast.net wrote:
> ----- "Philip (flip) Kromer" <f...@infochimps.org> wrote:
>
>
>
> > Hi all,
>
> > I'm pleased to announce that Infochimps is making datasets from our
> > massive scrape of the Twitter corpus available for Chirp Hack day
> > devs.
>
> > There's a big opportunity for apps that draw on the historical record
> > and *structure* of twitter -- apps that require a global perspective
> > and intense computation. The following are available to mash up
> > against other datasets from infochimps.org or even just to
> > bootstrap-seed the database for your Hack Day application. We also
> > have a 30-machine cluster up to do further extractions, so if you have
> > something really interesting you'd like to pull please let me know.
>
> > Reputation Metrics from Reply and Follow graph s Uses algorithm
> > similar to pagerank to derive reputation, one using the a_follows_b
> > graph and one using the a_replies_b graphs
> > Reply/retweet/mention graph Every observed Reply, retweet, or mention
> > seen in a 1.6B-tweet sample (about 15% of historical record):
> > a_[rel]_b, user_a_id, user_b_id, tweet_id
> > Twitter Users by Background Color The number of users with each
> > background color: color code, user count
> > Twitter Users by Friends Count The number of users with a given number
> > of friends: number of friends, user count
> > Twitter Users by Followers Count The number of users with a given
> > number of followers: number of followers, user count
> > Twitter Users by Created At The number of users whose accounts were
> > created in a given month/day/hour along with the earliest seen ID in
> > that hour: timestamp to month/day/hour, user count
> > Smileys Smiley faces with user, date, tweet_id
> > Hashtags Hashtags with user, date, tweet_id
> > TweetUrl URLs with user, date, tweet_id
> > Twitter Users by Location The number of users in a location string (as
> > provided by the user in their profile). location, user count
> > Stock Tweets Tweets that include the stock symbol tag convention of
> > $STOCKNAME or $$. The tweet is listed for each time a tag is used in
> > the tweet. stock_tweet (resource name), symbol captured, tweet object
> > (all things in a tweet)
> > Stock Prices Daily stock prices for the NASDAQ, NYSE, AMEX exchanges
> > 1970-now symbol, open, low, close, high, volume
>
> > Parameters for what's available:
>
> > raw object size number of objs
> > a_follows_b 45.8 GB 1,587,838,568
> > a_mentions_b 29.5 GB 493,682,309
> > a_retweets_b 1.6 GB 36,022,061
> > twitter_user 3.1 GB 43,261,388
> > tweets 376.0 GB 1,641,624,381
> > hashtag 7.1 GB 139,916,844
> > smiley 4.4 GB 99,272,082
> > tweet_url 29.5 GB 433,278,116
>
> > If you'd like access to any of these, or have an idea that needs
> > something /not/ here, please let me know ( f...@infochimps.org ).
> > We're only opening access to Hack Day devs for now -- but please let
> > us know your ideas so we can show twitter how much demand there is for
> > aggregated access to data.
>
> > best,
> > flip
> > @mrflip
> > 512-659-6846
>
> > ----
> >http://infochimps.org
> > Find any dataset in the world
>
> This is too short notice for me to be able to come up with a use for these 
> data. But for the future, do you by any chance have access to *intraday 
> futures and options* time series? Daily stock data are more or less useless.

Reply via email to