[ 
https://issues.apache.org/jira/browse/NUTCH-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15238102#comment-15238102
 ] 

Bhavya Sanghavi commented on NUTCH-2249:
----------------------------------------

[~sujenshah]

> WordNet Integration for Cosine Similarity
> -----------------------------------------
>
>                 Key: NUTCH-2249
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2249
>             Project: Nutch
>          Issue Type: New Feature
>          Components: plugin, scoring
>            Reporter: Bhavya Sanghavi
>            Priority: Minor
>
> Integrated WordNet database to enhance the cosine similarity plugin. 
> This helps in reducing the size of the vectors for calculating the cosine 
> similarity by mapping the synonymous words to the same entry in the vector. 
> Consequently, it would increase the accuracy of the scores given to the 
> webpages to be crawled. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to