Yasara, This is the list of nouns which I suggest to remove.

realdonaldtrump
donaldtrump
trump
bernie
berniesanders
sanders
ted
tedcruz
cruz
hillary
clinton
hillaryclinton
imwithher
feelthebern
trumpforamerica
hillary2016
bernie2016
trump2016
makeamericagreatagain
election2016
uselection

On Fri, Mar 11, 2016 at 4:49 AM, Srinath Perera <[email protected]> wrote:

> Please cc a list.
>
> This is a stop words commonly removed.
>
> http://www.ranks.nl/stopwords
>
> Also you need to look at common words that does not help, and remove as
> well (words like you listed).
>
> --Srinath
>
> On Thu, Mar 10, 2016 at 9:43 PM, Yudhanjaya Wijeratne <[email protected]
> > wrote:
>
>> Guys, also please look at filtering out verbs as we discussed on
>> Wednesday. Currently there are words such as 'From' 'i'M' 'Like' 'That'
>> which add no value.
>> If the Stanford NLP filter is running, please keep only nouns and
>> adjectives and see what we get.
>>
>> Thanks,
>> Yudha
>>
>> On Thu, Mar 10, 2016 at 9:40 PM, Dinali Dabarera <[email protected]> wrote:
>>
>>> Ok :) Then.
>>> Thanks!
>>>
>>> On Thu, Mar 10, 2016 at 8:17 PM, Yasara Dissanayake <[email protected]>
>>> wrote:
>>>
>>>> even in DAS too :)
>>>>
>>>> On Thu, Mar 10, 2016 at 8:15 PM, Yasara Dissanayake <[email protected]>
>>>> wrote:
>>>>
>>>>> Dinaly currently NLP has stopped to find those problem in CEP and DAS
>>>>> that tread problem . so please ignore recent data :)
>>>>>
>>>>> On Thu, Mar 10, 2016 at 8:13 PM, Dinali Dabarera <[email protected]>
>>>>> wrote:
>>>>>
>>>>>> Sir,
>>>>>>
>>>>>> This is the word cloud we get only after using NLP, But according to
>>>>>> my point of view there are lot of unnecessary words  such as IM, GET,
>>>>>> BERNIE, TEDCRUZ, IMWITHHER, MAKEAMERICAGREATAGAIN, WE, etc..
>>>>>> These words are common and no need to be in the word cloud.
>>>>>>
>>>>>> Therefore what i think is that we need to filter the stop words and
>>>>>> then use NLP , otherwise its useless to keep a wordCloud.
>>>>>>
>>>>>> Thanks
>>>>>>
>>>>>> --
>>>>>> Dinali Rosemin
>>>>>> University of Peradeniya (Computer Engineering)
>>>>>> WSO2 Intern
>>>>>> 077-0198933
>>>>>>
>>>>>
>>>>>
>>>>
>>>
>>>
>>> --
>>> Dinali Rosemin
>>> University of Peradeniya (Computer Engineering)
>>> WSO2 Intern
>>> 077-0198933
>>>
>>
>>
>>
>> --
>> Yudhanjaya Wijeratne
>> Marketing Officer, WSO2 Inc <http://www.wso2.com>
>> +94775496911 | @yudhanjaya
>>
>
>
>
> --
> ============================
> Blog: http://srinathsview.blogspot.com twitter:@srinath_perera
> Site: http://people.apache.org/~hemapani/
> Photos: http://www.flickr.com/photos/hemapani/
> Phone: 0772360902
>



-- 
Dinali Rosemin
University of Peradeniya (Computer Engineering)
WSO2 Intern
077-0198933
_______________________________________________
Dev mailing list
[email protected]
http://wso2.org/cgi-bin/mailman/listinfo/dev

Reply via email to