Hi Atul, There is a fairly significant corpus of tweets available, although it is fairly old - see here: http://www.mail-archive.com/[email protected]/msg05715.html. I believe that the second part has expired, but you should be unable to use the first part - it is several million tweets worth.
All the best, Tom On Wed, Oct 14, 2009 at 5:55 PM, Atul Kulkarni <[email protected]>wrote: > Thanks, John. I am not only looking at hash tags, but also other things > that go along with tweets. I will keep in mind. I was actually curious about > Twitter's policy on this. What is there take on releasing a certain dataset > of say some random X number of users. Is it violation of any of their policy > or their agreement with the users. > > Regards, > Atul. > > > > On Wed, Oct 14, 2009 at 11:43 AM, JOHN OBRIEN <[email protected]>wrote: > >> Many people in academia / research have used TwapperKeeper service to >> capture tweets of interest (that are tagged) and export for analysis. >> Let me know if you have any questions. >> >> v/r, >> John >> http://TwapperKeeper.com >> [email protected] >> @jobrieniii >> >> >> >> On Oct 14, 2009, at 12:39 PM, Atul Kulkarni wrote: >> >> Hi All, >> >> I am curious if there is any Twitter data set already available for >> research or academia? If it is already not there then can one crawl through >> the users and build one and release it to the research community without any >> charge? What would be Twitter's official policy on this? >> >> I am sure there will be a lot of interest in academia from the Linguistics >> perspective and Machine Learning perspective. These questions are just out >> of curiosity and feasibility study types. >> >> -- >> Regards, >> Atul Kulkarni >> www.d.umn.edu/~kulka053 <http://www.d.umn.edu/%7Ekulka053> >> >> >> > > > -- > Regards, > Atul Kulkarni > www.d.umn.edu/~kulka053 <http://www.d.umn.edu/%7Ekulka053> >
