On 1/20/10 2:35 AM, Jason Rutherglen wrote:
We've got Newsgroup classification. I'm kinda of interested in
creating a Twitter classification system, or at least playing
around with it. Also I think as a relevant growing large data
set, it seems Twitter fit well with Hadoop based machine
learning algorithms... Just throwing out into the wild!
Hi Jason.
I think the biggest issues here are twofold.
1. access to the data, although I'm sure the ASF could work something
out here
2. training data. wouldn't you need a set of 'tweets' classified in some
manner? or were you thinking of using a different data source to base it on?