[jira] [Commented] (NUTCH-1113) Merging segments causes URLs to vanish from crawldb/index?

2014-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13865632#comment-13865632 ] Markus Jelsma commented on NUTCH-1113: -- I'll run some more tests tomorrow, at least i

[jira] [Commented] (NUTCH-1113) Merging segments causes URLs to vanish from crawldb/index?

2014-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13865631#comment-13865631 ] Markus Jelsma commented on NUTCH-1113: -- No. We don't really seem to be losing records

[jira] [Commented] (NUTCH-1113) Merging segments causes URLs to vanish from crawldb/index?

2014-01-08 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13865624#comment-13865624 ] Sebastian Nagel commented on NUTCH-1113: Isn't this fixed with NUTCH-1520? > Merg

[jira] [Updated] (NUTCH-1113) Merging segments causes URLs to vanish from crawldb/index?

2014-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1113: - Attachment: NUTCH-1113-trunk.patch Patch for trunk with Edward's fix. That fix at least solves a

Re: Nightly builds

2014-01-08 Thread Julien Nioche
Great stuff, thanks Lewis On 8 January 2014 12:00, Lewis John Mcgibbney wrote: > Hi Folks, > > On Wed, Jan 8, 2014 at 4:06 AM, wrote: > >> I'm working on getting the Jenkins job configuration stable again. >> Something seems to have been reset or in not correct. >> I'll update here once we are

[jira] [Commented] (NUTCH-1113) Merging segments causes URLs to vanish from crawldb/index?

2014-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13865400#comment-13865400 ] Markus Jelsma commented on NUTCH-1113: -- NUTCH-1616 is actually a duplicate of this is

[jira] [Resolved] (NUTCH-1616) SegmentMerger missing proper crawl_fetch datum

2014-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1616. -- Resolution: Duplicate Ah, i finally realize this issue is a exact duplicate of NUTCH-1113, my p

[jira] [Commented] (NUTCH-1695) NutchDocument.toString()

2014-01-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13865353#comment-13865353 ] Hudson commented on NUTCH-1695: --- SUCCESS: Integrated in Nutch-nutchgora #878 (See [https://

Jenkins build is back to normal : Nutch-nutchgora #878

2014-01-08 Thread Apache Jenkins Server
See

Re: Nightly builds

2014-01-08 Thread Lewis John Mcgibbney
Hi Folks, On Wed, Jan 8, 2014 at 4:06 AM, wrote: > I'm working on getting the Jenkins job configuration stable again. > Something seems to have been reset or in not correct. > I'll update here once we are back to stable builds. > > Seems that there was an upgrade to the Jenkins servers we run th

Jenkins build is back to normal : Nutch-trunk #2483

2014-01-08 Thread Apache Jenkins Server
See

[jira] [Updated] (NUTCH-1695) NutchDocument.toString()

2014-01-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1695: Attachment: NUTCH-1695-2.x.patch patch for 2.x HEAD > NutchDocument.toString() > -

[jira] [Commented] (NUTCH-1695) NutchDocument.toString()

2014-01-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13865341#comment-13865341 ] Lewis John McGibbney commented on NUTCH-1695: - Committed @revision 1556503 in

Build failed in Jenkins: Nutch-trunk #2482

2014-01-08 Thread Apache Jenkins Server
See -- [...truncated 6786 lines...] deps-jar: clean-lib: resolve-default: [ivy:resolve] :: loading settings :: file = compile:

Build failed in Jenkins: Nutch-nutchgora #877

2014-01-08 Thread Apache Jenkins Server
See Changes: [lewismc] GORA-1696 Enable use of (Gora) SNAPSHOT dependencies -- [...truncated 5141 lines...] deploy: copy-generated-lib: deploy: copy-generated-lib: init: init-plugin: deps-ja

[jira] [Resolved] (NUTCH-1696) Enable use of (Gora) SNAPSHOT dependencies

2014-01-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1696. - Resolution: Fixed Confirmed with various folks offline. Committed @revision 15564

[jira] [Commented] (NUTCH-1693) TextMD5Signatue compute on textual content

2014-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13865306#comment-13865306 ] Markus Jelsma commented on NUTCH-1693: -- By the way, there are several places in Nutch

[jira] [Updated] (NUTCH-1693) TextMD5Signatue compute on textual content

2014-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1693: - Attachment: NUTCH-1693-trunk.patch Updates patch for trunk. It now relies on Strings that are UTF

Build failed in Jenkins: Nutch-trunk #2481

2014-01-08 Thread Apache Jenkins Server
See Changes: [markus] NUTCH-1695 Add NutchDocument.toString() to ease debugging -- [...truncated 6790 lines...] clean-lib: resolve-default: [ivy:resolve] :: loading settings :: file = /home/hudson

[jira] [Commented] (NUTCH-1695) NutchDocument.toString()

2014-01-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13865258#comment-13865258 ] Hudson commented on NUTCH-1695: --- FAILURE: Integrated in Nutch-trunk #2481 (See [https://bui

[jira] [Resolved] (NUTCH-1695) NutchDocument.toString()

2014-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1695. -- Resolution: Fixed Committed to trunk in revision 1556474. > NutchDocument.toString() > --

[jira] [Commented] (NUTCH-1695) NutchDocument.toString()

2014-01-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13865249#comment-13865249 ] Markus Jelsma commented on NUTCH-1695: -- Thanks! > NutchDocument.toString() > ---