[jira] [Commented] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2015-09-09 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738216#comment-14738216 ] Alexander Kingson commented on NUTCH-1679: -- Attaching patch, which is tested for crawling with

[jira] [Updated] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2015-09-09 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Kingson updated NUTCH-1679: - Attachment: NUTCH-1679_3.patch > UpdateDb using batchId, link may override crawled page.

[jira] [Commented] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2015-08-31 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14723886#comment-14723886 ] Alexander Kingson commented on NUTCH-1679: -- I may have a patch in 1-2 weeks. > UpdateDb using

[jira] [Commented] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2015-08-28 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14720880#comment-14720880 ] Alexander Kingson commented on NUTCH-1679: -- I took a look to 1.x updateReducer. I

[jira] [Commented] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2015-08-25 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14711774#comment-14711774 ] Alexander Kingson commented on NUTCH-1679: -- Hi, It seems to me that in this case

[jira] [Commented] (NUTCH-961) Expose Tika's boilerpipe support

2015-04-01 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14391558#comment-14391558 ] Alexander Kingson commented on NUTCH-961: - Hello, Since I was not getting

[jira] [Updated] (NUTCH-961) Expose Tika's boilerpipe support

2015-04-01 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Kingson updated NUTCH-961: Attachment: nutch-2.x-boilerpipe.patch Expose Tika's boilerpipe support

[jira] [Commented] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2014-07-21 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14069567#comment-14069567 ] Alexander Kingson commented on NUTCH-1679: -- Hi, I was suggesting to close the

[jira] [Comment Edited] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2014-07-21 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14069567#comment-14069567 ] Alexander Kingson edited comment on NUTCH-1679 at 7/22/14 12:15 AM:

[jira] [Comment Edited] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2014-07-21 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14069567#comment-14069567 ] Alexander Kingson edited comment on NUTCH-1679 at 7/22/14 12:46 AM:

[jira] [Updated] (NUTCH-1714) Nutch 2.x upgrade to use GORA_94 branch

2014-04-17 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Kingson updated NUTCH-1714: - Attachment: (was: NUTCH-1714_NUTCH-1714_v2_v3.patch) Nutch 2.x upgrade to use

[jira] [Updated] (NUTCH-1714) Nutch 2.x upgrade to use GORA_94 branch

2014-04-17 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Kingson updated NUTCH-1714: - Attachment: NUTCH-1714_NUTCH-1714_v2_v3.patch Replacing patch that counts changes to

[jira] [Updated] (NUTCH-1714) Nutch 2.x upgrade to use GORA_94 branch

2014-04-15 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Kingson updated NUTCH-1714: - Attachment: NUTCH-1714_NUTCH-1714_v2_v3.patch Hello, Attaching patch that is a result

[jira] [Commented] (NUTCH-945) Indexing to multiple SOLR Servers

2013-01-28 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13564953#comment-13564953 ] Alexander Kingson commented on NUTCH-945: - I see that the issue is unresolved.Is

[jira] [Commented] (NUTCH-1457) Nutch2 Refactor the update process so that fetched items are only processed once

2012-11-08 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13493413#comment-13493413 ] Alexander Kingson commented on NUTCH-1457: -- Hi, Could you please give me more

[jira] [Commented] (NUTCH-1457) Nutch2 Refactor the update process so that fetched items are only processed once

2012-11-05 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13491124#comment-13491124 ] Alexander Kingson commented on NUTCH-1457: -- Can we use batchId in update command

[jira] [Updated] (NUTCH-1411) nutchgora fetcher.store.content does not work

2012-07-06 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Kingson updated NUTCH-1411: - Attachment: storeContent.patch This patch is tested mysql storage, only.

[jira] [Comment Edited] (NUTCH-1411) nutchgora fetcher.store.content does not work

2012-07-06 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13408311#comment-13408311 ] Alexander Kingson edited comment on NUTCH-1411 at 7/6/12 8:54 PM: