[jira] [Commented] (NUTCH-2220) Rename db.* options used only by the linkdb to linkdb.*

2016-02-22 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157831#comment-15157831 ] Sebastian Nagel commented on NUTCH-2220: 0 / +1 Since this breaks existing crawl configurations: a

[jira] [Commented] (NUTCH-2221) Introduce db.ignore.internal.links to FetcherThread

2016-02-22 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157816#comment-15157816 ] Sebastian Nagel commented on NUTCH-2221: +1 Just to consider: the additional argument to

[jira] [Commented] (NUTCH-2216) db.ignore.*.links to optionally follow internal redirects

2016-02-22 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1515#comment-1515 ] Sebastian Nagel commented on NUTCH-2216: * this was the case before, but shouldn't

[jira] [Updated] (NUTCH-2228) index-replace unit test fails

2016-02-22 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2228: --- Attachment: NUTCH-2228.patch > index-replace unit test fails > -

[jira] [Updated] (NUTCH-2228) index-replace unit test fails

2016-02-22 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2228: --- Patch Info: Patch Available > index-replace unit test fails > - >

[jira] [Comment Edited] (NUTCH-2228) index-replace unit test fails

2016-02-22 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157655#comment-15157655 ] Sebastian Nagel edited comment on NUTCH-2228 at 2/22/16 8:38 PM: - The name

[jira] [Commented] (NUTCH-2228) index-replace unit test fails

2016-02-22 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157655#comment-15157655 ] Sebastian Nagel commented on NUTCH-2228: The name of the failing test "testInvalidPatterns"

[jira] [Commented] (NUTCH-2228) index-replace unit test fails

2016-02-22 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157632#comment-15157632 ] Sebastian Nagel commented on NUTCH-2228: That's only a problem if Nutch is built with Java 8.

[jira] [Created] (NUTCH-2228) index-replace unit test fails

2016-02-22 Thread Markus Jelsma (JIRA)
Markus Jelsma created NUTCH-2228: Summary: index-replace unit test fails Key: NUTCH-2228 URL: https://issues.apache.org/jira/browse/NUTCH-2228 Project: Nutch Issue Type: Bug

[jira] [Work stopped] (NUTCH-2227) RegexParseFilter

2016-02-22 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2227 stopped by Markus Jelsma. > RegexParseFilter > > > Key: NUTCH-2227 >

[jira] [Updated] (NUTCH-2227) RegexParseFilter

2016-02-22 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2227: - Attachment: NUTCH-2227.patch Updated patch, added negative test. Which works. Will commit

[jira] [Updated] (NUTCH-2227) RegexParseFilter

2016-02-22 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2227: - Attachment: NUTCH-2227.patch Updated patch, build.xml was missing > RegexParseFilter >

[jira] [Updated] (NUTCH-2227) RegexParseFilter

2016-02-22 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2227: - Attachment: NUTCH-2227.patch Patch for trunk! Tests pass. > RegexParseFilter >

[jira] [Work started] (NUTCH-2227) RegexParseFilter

2016-02-22 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2227 started by Markus Jelsma. > RegexParseFilter > > > Key: NUTCH-2227 >

[jira] [Updated] (NUTCH-2227) RegexParseFilter

2016-02-22 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2227: - Description: A parse filter that takes a regex and a field name. If regex matches via

[jira] [Created] (NUTCH-2227) RegexParseFilter

2016-02-22 Thread Markus Jelsma (JIRA)
Markus Jelsma created NUTCH-2227: Summary: RegexParseFilter Key: NUTCH-2227 URL: https://issues.apache.org/jira/browse/NUTCH-2227 Project: Nutch Issue Type: New Feature Components:

[jira] [Commented] (NUTCH-2219) Criteria order to be configurable in DeduplicationJob

2016-02-22 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157091#comment-15157091 ] Hudson commented on NUTCH-2219: --- SUCCESS: Integrated in Nutch-trunk #3350 (See

[jira] [Updated] (NUTCH-2219) Criteria order to be configurable in DeduplicationJob

2016-02-22 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2219: - Fix Version/s: 1.12 > Criteria order to be configurable in DeduplicationJob >

[jira] [Updated] (NUTCH-2219) Criteria order to be configurable in DeduplicationJob

2016-02-22 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2219: - Affects Version/s: 1.11 > Criteria order to be configurable in DeduplicationJob >

[jira] [Resolved] (NUTCH-2219) Criteria order to be configurable in DeduplicationJob

2016-02-22 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-2219. -- Resolution: Fixed Committed to trunk in revision 1731651. Thanks Ron van der Vegt > Criteria

[jira] [Commented] (NUTCH-2226) SOLR mismatch in deploy mode

2016-02-22 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157027#comment-15157027 ] Markus Jelsma commented on NUTCH-2226: -- Hello - how is this related? Are you using trunk? We run

[jira] [Commented] (NUTCH-2220) Rename db.* options used only by the linkdb to linkdb.*

2016-02-22 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15156711#comment-15156711 ] Markus Jelsma commented on NUTCH-2220: -- Any comments to this change, e.g. separate db and linkdb

RE: [RESULT] [VOTE] Moving to Git

2016-02-22 Thread Markus Jelsma
Can someone please put up a small howto somewhere? I need to know how to: * check out trunk * check out a specific tag * do a svn up * create a patch, e.g. svn diff * perform a commit Thanks, Markus -Original message- > From:Mattmann, Chris A (3980) >