[ 
https://issues.apache.org/jira/browse/NUTCH-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15219364#comment-15219364
 ] 

ASF GitHub Bot commented on NUTCH-2245:
---------------------------------------

GitHub user bhavyasanghavi opened a pull request:

    https://github.com/apache/nutch/pull/101

    Fix for NUTCH-2245 NGram Model for Cosine Similarity by bhavyasanghavi

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/bhavyasanghavi/nutch NUTCH-2245

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nutch/pull/101.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #101
    
----
commit 657a5f85029a8192555c0a4104cbd5f350659f62
Author: Bhavya Sanghavi <[email protected]>
Date:   2016-03-24T05:24:40Z

    develop file

commit f0c954f15514998e74edd1b66876d9318bce66a3
Author: Bhavya Sanghavi <[email protected]>
Date:   2016-03-24T06:56:09Z

    added NGram Similarity

commit b4a661d3fd777e8871604b94549aab884c0d8c34
Author: Bhavya Sanghavi <[email protected]>
Date:   2016-03-24T16:54:28Z

    added NGram Similarity

commit 34f9f54c66be6fa887a1f054d86e803049341c31
Author: Bhavya Sanghavi <[email protected]>
Date:   2016-03-27T04:25:55Z

    remove NGram.java

commit 6ff2e733a9d08b0d2752ba374fbc064e2005b712
Author: Bhavya Sanghavi <[email protected]>
Date:   2016-03-29T14:55:35Z

    Added ngram feature support to Cosine Similarity

----


> Developed the NGram Model on the existing Unigram Cosine Similarity Model
> -------------------------------------------------------------------------
>
>                 Key: NUTCH-2245
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2245
>             Project: Nutch
>          Issue Type: New Feature
>          Components: plugin, scoring
>            Reporter: Bhavya Sanghavi
>            Assignee: Sujen Shah
>            Priority: Minor
>              Labels: memex
>
> Built on the existing unigram cosine similarity model by adding the Ngram 
> model, thus providing flexibility to the user to choose the window size for 
> scoring the similarity between webpages and the gold standard.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to