[jira] [Assigned] (NUTCH-1585) Ensure duplicate tags do not exist in microformat-reltag tag set.

2013-06-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-1585: --- Assignee: Lewis John McGibbney > Ensure duplicate tags do not exist in mi

[jira] [Updated] (NUTCH-1585) Ensure duplicate tags do not exist in microformat-reltag tag set.

2013-06-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1585: Attachment: NUTCH-1585-trunk.patch NUTCH-1585-2.x.patch patches for

[jira] [Created] (NUTCH-1585) Ensure duplicate tags do not exist in microformat-reltag tag set.

2013-06-18 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-1585: --- Summary: Ensure duplicate tags do not exist in microformat-reltag tag set. Key: NUTCH-1585 URL: https://issues.apache.org/jira/browse/NUTCH-1585 Project

[jira] [Created] (NUTCH-1584) Port NUTCH-1405 Allow to overwrite CrawlDatum's with injected entries to 2.x

2013-06-18 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-1584: --- Summary: Port NUTCH-1405 Allow to overwrite CrawlDatum's with injected entries to 2.x Key: NUTCH-1584 URL: https://issues.apache.org/jira/browse/NUTCH-1584

[jira] [Updated] (NUTCH-1527) Port nutch-elasticsearch-indexer to Nutch

2013-06-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1527: Attachment: NUTCH-1527v2.patch New patch removing your Boilerpipe stuff Markus. I a

[jira] [Commented] (NUTCH-1527) Port nutch-elasticsearch-indexer to Nutch

2013-06-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13687176#comment-13687176 ] Lewis John McGibbney commented on NUTCH-1527: - Hi Markus, the attached patch a

Jenkins build is back to normal : Nutch-trunk #2245

2013-06-18 Thread Apache Jenkins Server
See

[jira] [Commented] (NUTCH-1475) Index-More Plugin -- A better fall back value for date field

2013-06-18 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13687079#comment-13687079 ] Hudson commented on NUTCH-1475: --- Integrated in Nutch-trunk #2245 (See [https://builds.apach

[jira] [Resolved] (NUTCH-1475) Index-More Plugin -- A better fall back value for date field

2013-06-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1475. - Resolution: Fixed Committed @revision 1494234 in trunk. Thank you [~wastl-nagel]

Re: Nutch Site

2013-06-18 Thread Mattmann, Chris A (398J)
Woot you da man Lewis ++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/

[jira] [Commented] (NUTCH-1527) Port nutch-elasticsearch-indexer to Nutch

2013-06-18 Thread lufeng (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13686830#comment-13686830 ] lufeng commented on NUTCH-1527: --- Thanks Markus, I try the patch and can index the document s

[jira] [Updated] (NUTCH-1475) Index-More Plugin -- A better fall back value for date field

2013-06-18 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-1475: --- Attachment: NUTCH-1475-trunk-v1.patch Why not rely first on CrawlDatum's modifiedTime? See pa

[jira] [Updated] (NUTCH-1583) Headings does not support multiValued headings

2013-06-18 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1583: - Attachment: NUTCH-1583.patch Patch for trunk. If headings.multivalued=true multiple values will b

[jira] [Created] (NUTCH-1583) Headings does not support multiValued headings

2013-06-18 Thread Markus Jelsma (JIRA)
Markus Jelsma created NUTCH-1583: Summary: Headings does not support multiValued headings Key: NUTCH-1583 URL: https://issues.apache.org/jira/browse/NUTCH-1583 Project: Nutch Issue Type: Impr

[jira] [Updated] (NUTCH-1527) Port nutch-elasticsearch-indexer to Nutch

2013-06-18 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1527: - Attachment: NUTCH-1527.patch Ok, here's a new patch. If you set elastic.host (elastic.port is def

Re: Nutch Site

2013-06-18 Thread Julien Nioche
Hi Lewis, Brilliant! Thanks a lot Julien On 18 June 2013 05:32, Lewis John Mcgibbney wrote: > Hi All, > @Julien, > A while ago you mentioned about changing the Nutch site to be more direct > towards Downloads. I agreed with this but as I didn't deal with it then and > there, it got put to the