Hi Atul,

There is a fairly significant corpus of tweets available, although it is
fairly old - see here:
http://www.mail-archive.com/twitter-development-talk@googlegroups.com/msg05715.html.
I believe that the second part has expired, but you should be unable to use
the first part - it is several million tweets worth.

All the best,

Tom

On Wed, Oct 14, 2009 at 5:55 PM, Atul Kulkarni <atulskulka...@gmail.com>wrote:

> Thanks, John. I am not only looking at hash tags, but also other things
> that go along with tweets. I will keep in mind. I was actually curious about
> Twitter's policy on this. What is there take on releasing a certain dataset
> of say some random X number of users. Is it violation of any of their policy
> or their agreement with the users.
>
> Regards,
> Atul.
>
>
>
> On Wed, Oct 14, 2009 at 11:43 AM, JOHN OBRIEN <john.obr...@twangu.com>wrote:
>
>> Many people in academia / research have used TwapperKeeper service to
>> capture tweets of interest (that are tagged) and export for analysis.
>> Let me know if you have any questions.
>>
>> v/r,
>> John
>> http://TwapperKeeper.com
>> jobr...@ob3solutions.com
>> @jobrieniii
>>
>>
>>
>> On Oct 14, 2009, at  12:39 PM, Atul Kulkarni wrote:
>>
>> Hi All,
>>
>> I am curious if there is any Twitter data set already available for
>> research or academia? If it is already not there then can one crawl through
>> the users and build one and release it to the research community without any
>> charge? What would be Twitter's official policy on this?
>>
>> I am sure there will be a lot of interest in academia from the Linguistics
>> perspective and Machine Learning perspective. These questions are just out
>> of curiosity and feasibility study types.
>>
>> --
>> Regards,
>> Atul Kulkarni
>> www.d.umn.edu/~kulka053 <http://www.d.umn.edu/%7Ekulka053>
>>
>>
>>
>
>
> --
> Regards,
> Atul Kulkarni
> www.d.umn.edu/~kulka053 <http://www.d.umn.edu/%7Ekulka053>
>

Reply via email to