[
https://issues.apache.org/jira/browse/NUTCH-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2249:
-----------------------------------
Fix Version/s: (was: 1.15)
> WordNet Integration for Cosine Similarity
> -----------------------------------------
>
> Key: NUTCH-2249
> URL: https://issues.apache.org/jira/browse/NUTCH-2249
> Project: Nutch
> Issue Type: New Feature
> Components: plugin, scoring
> Reporter: Bhavya Sanghavi
> Assignee: Sujen Shah
> Priority: Minor
> Labels: memex
>
> Integrated WordNet database to enhance the cosine similarity plugin.
> This helps in reducing the size of the vectors for calculating the cosine
> similarity by mapping the synonymous words to the same entry in the vector.
> Consequently, it would increase the accuracy of the scores given to the
> webpages to be crawled.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)