Anastasija, There might be a few appropriate sentiment datasets listed in my homework on Twitter sentiment analysis:
https://github.com/utcompling/applied-nlp/wiki/Homework5 There may also be some useful data sets in the Crowdflower Open Data collection: https://www.crowdflower.com/data-for-everyone/ Hope this helps! -Jason On Wed, 22 Jun 2016 at 15:59 Anastasija Mensikova < mensikova.anastas...@gmail.com> wrote: > Hi everyone, > > Some updates on our Sentiment Analysis Parser work. > > You might have noticed, I have enhanced our website (the GH page) recently, > polished it and made it more user-friendly. My next step will be sending a > pull request to Tika. However, my main goal until the end of Google Summer > of Code is to enhance the parser in a way that will allow it to work > categorically (in other words, the sentiment determined won't be just > positive or negative, it will have a few categories). This means that my > next step is to look for a categorical open data set (which I will > hopefully do by the end of the weekend the latest) and, of course, enhance > my model and training. After that I will look into how the confidence > levels can be increased. > > Have a great day/night! > > Thank you, > Anastasija Mensikova. >