Re: [VOTE] Moving to Git

2016-01-08 Thread Julien Nioche
+1 to move to Git Note : I don't think Dennis is on the PMC anymore Ju On 8 January 2016 at 08:46, Chris Mattmann wrote: > Hi Everyone, > > I proposed this earlier, and we said we’d wait until after the > 1.11 release. So it’s time to VOTE to move Nutch to Git. So > far,

Re: [VOTE] Moving to Git

2016-01-08 Thread Sebastian Nagel
+1 Sebastian On 01/08/2016 09:46 AM, Chris Mattmann wrote: > Hi Everyone, > > I proposed this earlier, and we said we’d wait until after the > 1.11 release. So it’s time to VOTE to move Nutch to Git. So > far, the following people have expressed +1s and if I don’t hear > otherwise, I will

[jira] [Resolved] (NUTCH-2169) Integrate index-html into Nutch build

2016-01-08 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2169. Resolution: Fixed Assignee: Sebastian Nagel Committed to 2.x, r1723794. > Integrate

[jira] [Commented] (NUTCH-2169) Integrate index-html into Nutch build

2016-01-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089997#comment-15089997 ] Hudson commented on NUTCH-2169: --- SUCCESS: Integrated in Nutch-nutchgora #1544 (See

[jira] [Resolved] (NUTCH-1449) Optionally delete documents skipped by IndexingFilters

2016-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1449. -- Resolution: Fixed Committed revision 1723688. > Optionally delete documents skipped by

[jira] [Updated] (NUTCH-2178) DeduplicationJob to optionally group on host or domain

2016-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2178: - Summary: DeduplicationJob to optionally group on host or domain (was: DeduplicationJob to

[jira] [Resolved] (NUTCH-2178) DeduplicationJob to optionally group on host or domain

2016-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-2178. -- Resolution: Fixed Committed to trunk in revision 1723690. > DeduplicationJob to optionally

[jira] [Comment Edited] (NUTCH-1449) Optionally delete documents skipped by IndexingFilters

2016-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089073#comment-15089073 ] Markus Jelsma edited comment on NUTCH-1449 at 1/8/16 11:16 AM: --- Committed to

[jira] [Commented] (NUTCH-2190) Protocol normalizer

2016-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089081#comment-15089081 ] Markus Jelsma commented on NUTCH-2190: -- I'll also get this one in soon unless objections of course :)

[jira] [Commented] (NUTCH-2168) Parse-tika fails to retrieve parser

2016-01-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15090337#comment-15090337 ] Lewis John McGibbney commented on NUTCH-2168: - +1 for commit [~wastl-nagel] nice catch and

[jira] [Comment Edited] (NUTCH-2168) Parse-tika fails to retrieve parser

2016-01-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15090337#comment-15090337 ] Lewis John McGibbney edited comment on NUTCH-2168 at 1/9/16 2:03 AM: -

[jira] [Updated] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI

2016-01-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2094: Fix Version/s: (was: 2.4) 2.3.1 > Stopping and Restarting a

[jira] [Updated] (NUTCH-2166) Add reverse URL format to dump tool

2016-01-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2166: Fix Version/s: (was: 2.4) > Add reverse URL format to dump tool >

[jira] [Updated] (NUTCH-2165) FileDumper Util hard codes part-# folder name

2016-01-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2165: Fix Version/s: (was: 2.4) > FileDumper Util hard codes part-# folder name >

[jira] [Commented] (NUTCH-1838) Host and domain based regex and automaton filtering

2016-01-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089165#comment-15089165 ] Hudson commented on NUTCH-1838: --- SUCCESS: Integrated in Nutch-trunk #3332 (See

[jira] [Commented] (NUTCH-2191) Add protocol-htmlunit

2016-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089297#comment-15089297 ] Markus Jelsma commented on NUTCH-2191: -- Hi - i've 'read' that discussion that couple of weeks ago

[jira] [Commented] (NUTCH-2178) DeduplicationJob to optionally group on host or domain

2016-01-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089099#comment-15089099 ] Hudson commented on NUTCH-2178: --- SUCCESS: Integrated in Nutch-trunk #3331 (See

[jira] [Commented] (NUTCH-1449) Optionally delete documents skipped by IndexingFilters

2016-01-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089098#comment-15089098 ] Hudson commented on NUTCH-1449: --- SUCCESS: Integrated in Nutch-trunk #3331 (See

[jira] [Commented] (NUTCH-1838) Host and domain based regex and automaton filtering

2016-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089121#comment-15089121 ] Markus Jelsma commented on NUTCH-1838: -- Committed to trunk in revision 1723710. > Host and domain

Re: [VOTE] Moving to Git

2016-01-08 Thread Sujen Shah
+1 Regards, Sujen Shah M.S - Computer Science (Class of 2016) University of Southern California http://www.linkedin.com/in/sujenshah On Fri, Jan 8, 2016 at 2:58 PM, Julien Nioche wrote: > +1 to move to Git > > Note : I don't think Dennis is on the PMC anymore >

[jira] [Commented] (NUTCH-2191) Add protocol-htmlunit

2016-01-08 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089545#comment-15089545 ] Chris A. Mattmann commented on NUTCH-2191: -- Markus thanks! Check out: