Yasara, This is the list of nouns which I suggest to remove. realdonaldtrump donaldtrump trump bernie berniesanders sanders ted tedcruz cruz hillary clinton hillaryclinton imwithher feelthebern trumpforamerica hillary2016 bernie2016 trump2016 makeamericagreatagain election2016 uselection
On Fri, Mar 11, 2016 at 4:49 AM, Srinath Perera <[email protected]> wrote: > Please cc a list. > > This is a stop words commonly removed. > > http://www.ranks.nl/stopwords > > Also you need to look at common words that does not help, and remove as > well (words like you listed). > > --Srinath > > On Thu, Mar 10, 2016 at 9:43 PM, Yudhanjaya Wijeratne <[email protected] > > wrote: > >> Guys, also please look at filtering out verbs as we discussed on >> Wednesday. Currently there are words such as 'From' 'i'M' 'Like' 'That' >> which add no value. >> If the Stanford NLP filter is running, please keep only nouns and >> adjectives and see what we get. >> >> Thanks, >> Yudha >> >> On Thu, Mar 10, 2016 at 9:40 PM, Dinali Dabarera <[email protected]> wrote: >> >>> Ok :) Then. >>> Thanks! >>> >>> On Thu, Mar 10, 2016 at 8:17 PM, Yasara Dissanayake <[email protected]> >>> wrote: >>> >>>> even in DAS too :) >>>> >>>> On Thu, Mar 10, 2016 at 8:15 PM, Yasara Dissanayake <[email protected]> >>>> wrote: >>>> >>>>> Dinaly currently NLP has stopped to find those problem in CEP and DAS >>>>> that tread problem . so please ignore recent data :) >>>>> >>>>> On Thu, Mar 10, 2016 at 8:13 PM, Dinali Dabarera <[email protected]> >>>>> wrote: >>>>> >>>>>> Sir, >>>>>> >>>>>> This is the word cloud we get only after using NLP, But according to >>>>>> my point of view there are lot of unnecessary words such as IM, GET, >>>>>> BERNIE, TEDCRUZ, IMWITHHER, MAKEAMERICAGREATAGAIN, WE, etc.. >>>>>> These words are common and no need to be in the word cloud. >>>>>> >>>>>> Therefore what i think is that we need to filter the stop words and >>>>>> then use NLP , otherwise its useless to keep a wordCloud. >>>>>> >>>>>> Thanks >>>>>> >>>>>> -- >>>>>> Dinali Rosemin >>>>>> University of Peradeniya (Computer Engineering) >>>>>> WSO2 Intern >>>>>> 077-0198933 >>>>>> >>>>> >>>>> >>>> >>> >>> >>> -- >>> Dinali Rosemin >>> University of Peradeniya (Computer Engineering) >>> WSO2 Intern >>> 077-0198933 >>> >> >> >> >> -- >> Yudhanjaya Wijeratne >> Marketing Officer, WSO2 Inc <http://www.wso2.com> >> +94775496911 | @yudhanjaya >> > > > > -- > ============================ > Blog: http://srinathsview.blogspot.com twitter:@srinath_perera > Site: http://people.apache.org/~hemapani/ > Photos: http://www.flickr.com/photos/hemapani/ > Phone: 0772360902 > -- Dinali Rosemin University of Peradeniya (Computer Engineering) WSO2 Intern 077-0198933
_______________________________________________ Dev mailing list [email protected] http://wso2.org/cgi-bin/mailman/listinfo/dev
