[jira] [Work started] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2038 started by Chris A. Mattmann. > Naive Bayes classifier based html Parse filter (for filtering outlinks)

Build failed in Jenkins: Nutch-trunk #3182

2015-06-29 Thread Apache Jenkins Server
See -- [...truncated 3916 lines...] [echo] Compiling plugin: urlnormalizer-host [javac] Compiling 2 source files to

[jira] [Commented] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread Asitang Mishra (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14606592#comment-14606592 ] Asitang Mishra commented on NUTCH-2038: --- Hi [~wastl-nagel], I am facing the followi

[jira] [Resolved] (NUTCH-2052) Enhance index-static to allow configurable delimiters

2015-06-29 Thread Peter Ciuffetti (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Ciuffetti resolved NUTCH-2052. Resolution: Fixed Fix Version/s: 1.11 Pull request in https://github.com/apache/nutch

[GitHub] nutch pull request: Nutch 2052 - Enhancement to index-static to al...

2015-06-29 Thread PeterCiuffetti
GitHub user PeterCiuffetti opened a pull request: https://github.com/apache/nutch/pull/43 Nutch 2052 - Enhancement to index-static to allow user-defined delimiters for fields You can merge this pull request into a Git repository by running: $ git pull https://github.com/Peter

Unsubscribe

2015-06-29 Thread Disha Ajmani

[Nutch Wiki] Update of "SimilarityScoringFilter" by SujenShah

2015-06-29 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "SimilarityScoringFilter" page has been changed by SujenShah: https://wiki.apache.org/nutch/SimilarityScoringFilter New page: = Similarity based Scoring = <> == Summary == The link

[Nutch Wiki] Update of "NutchScoring" by SujenShah

2015-06-29 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "NutchScoring" page has been changed by SujenShah: https://wiki.apache.org/nutch/NutchScoring?action=diff&rev1=13&rev2=14 * [[http://nutch.apache.org/apidocs/apidocs-1.9/index.ht

[jira] [Commented] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14606359#comment-14606359 ] ASF GitHub Bot commented on NUTCH-2038: --- GitHub user asitang opened a pull request:

[GitHub] nutch pull request: NUTCH-2038

2015-06-29 Thread asitang
GitHub user asitang opened a pull request: https://github.com/apache/nutch/pull/42 NUTCH-2038 minor changes and suggestions by Sebastian. You can merge this pull request into a Git repository by running: $ git pull https://github.com/asitang/nutch NUTCH-2038 Alternatively you

[jira] [Commented] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14606348#comment-14606348 ] ASF GitHub Bot commented on NUTCH-2038: --- Github user asitang closed the pull request

[GitHub] nutch pull request: NUTCH-2038

2015-06-29 Thread asitang
Github user asitang closed the pull request at: https://github.com/apache/nutch/pull/41 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabl

[jira] [Commented] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14606234#comment-14606234 ] ASF GitHub Bot commented on NUTCH-2038: --- GitHub user asitang opened a pull request:

[GitHub] nutch pull request: NUTCH-2038

2015-06-29 Thread asitang
GitHub user asitang opened a pull request: https://github.com/apache/nutch/pull/41 NUTCH-2038 --added specific IOException messages --added files: conf/naivebayes-train.txt.template conf/naivebayes-wordlist.txt.template You can merge this pull request into a Git reposit

[jira] [Commented] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14606228#comment-14606228 ] ASF GitHub Bot commented on NUTCH-2038: --- Github user asitang closed the pull request

[GitHub] nutch pull request: NUTCH-2038

2015-06-29 Thread asitang
Github user asitang closed the pull request at: https://github.com/apache/nutch/pull/40 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabl

[jira] [Created] (NUTCH-2052) Enhance index-static to allow configurable delimiters

2015-06-29 Thread Peter Ciuffetti (JIRA)
Peter Ciuffetti created NUTCH-2052: -- Summary: Enhance index-static to allow configurable delimiters Key: NUTCH-2052 URL: https://issues.apache.org/jira/browse/NUTCH-2052 Project: Nutch Issue

[jira] [Commented] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14605729#comment-14605729 ] Sebastian Nagel commented on NUTCH-2038: Yep, this fixes problem #2. > Naive Baye

[jira] [Commented] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14605716#comment-14605716 ] Sebastian Nagel commented on NUTCH-2038: (#1) The unit tests fail if build and tes

[jira] [Commented] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14605681#comment-14605681 ] ASF GitHub Bot commented on NUTCH-2038: --- GitHub user asitang opened a pull request:

[GitHub] nutch pull request: NUTCH-2038

2015-06-29 Thread asitang
GitHub user asitang opened a pull request: https://github.com/apache/nutch/pull/40 NUTCH-2038 added all the jars in plugin.xml You can merge this pull request into a Git repository by running: $ git pull https://github.com/asitang/nutch NUTCH-2038 Alternatively you can review

[jira] [Commented] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread Asitang Mishra (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14605661#comment-14605661 ] Asitang Mishra commented on NUTCH-2038: --- Yup dint fail for me as well.. gonna list a

[jira] [Commented] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14605643#comment-14605643 ] Chris A. Mattmann commented on NUTCH-2038: -- Ugh, On #2, I guess I missed setting

[jira] [Reopened] (NUTCH-2038) Naive Bayes classifier based html Parse filter (for filtering outlinks)

2015-06-29 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reopened NUTCH-2038: # unit test TestParserFactory fails:: {noformat} Testcase: testGetParsers took 0.892 sec