[jira] Created: (NUTCH-697) Generate log output for solr indexer and dedup

2009-02-20 Thread Dmitry Lihachev (JIRA)
Generate log output for solr indexer and dedup -- Key: NUTCH-697 URL: https://issues.apache.org/jira/browse/NUTCH-697 Project: Nutch Issue Type: Improvement Components: indexer

[jira] Updated: (NUTCH-697) Generate log output for solr indexer and dedup

2009-02-20 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-697: -- Attachment: NUTCH-697_solr_logs.patch Generate log output for solr indexer and dedup

[jira] Updated: (NUTCH-694) Distributed Search Server fails

2009-02-20 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-694: - Attachment: NUTCH-694-2.patch I rechecked this again and there was also something else wrong, I am

[jira] Created: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles

2009-02-20 Thread JIRA
CrawlDb is corrupted after a few crawl cycles - Key: NUTCH-698 URL: https://issues.apache.org/jira/browse/NUTCH-698 Project: Nutch Issue Type: Bug Reporter: Doğacan Güney

[jira] Updated: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles

2009-02-20 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney updated NUTCH-698: Attachment: NUTCH-698_v1.patch Patch for the issue. Again, hadoop's MapWritable#putAll is broken

[jira] Updated: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles

2009-02-20 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney updated NUTCH-698: Priority: Blocker (was: Major) Oops. Wrong priority. Elevating this to blocker. CrawlDb is

[Nutch Wiki] Update of InstallingWeb2 by SamiSiren

2009-02-20 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The following page has been changed by SamiSiren: http://wiki.apache.org/nutch/InstallingWeb2 -- + == NOTE:

[jira] Updated: (NUTCH-694) Distributed Search Server fails

2009-02-20 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-694: - Patch Info: [Patch Available] Assignee: Sami Siren Distributed Search Server fails

[jira] Updated: (NUTCH-573) Multiple Domains - Query Search

2009-02-20 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-573: - Patch Info: [Patch Available] Multiple Domains - Query Search ---

[jira] Updated: (NUTCH-477) Extend URLFilters to support different filtering chains

2009-02-20 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-477: - Patch Info: [Patch Available] Extend URLFilters to support different filtering chains

[jira] Commented: (NUTCH-684) Dedup support for Solr

2009-02-20 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675309#action_12675309 ] Andrzej Bialecki commented on NUTCH-684: - A few comments to this patch (and to

Re: [Nutch Wiki] Update of InstallingWeb2 by SamiSiren

2009-02-20 Thread Andrzej Bialecki
Apache Wiki wrote: Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The following page has been changed by SamiSiren: http://wiki.apache.org/nutch/InstallingWeb2

Re: [Nutch Wiki] Update of InstallingWeb2 by SamiSiren

2009-02-20 Thread Sami Siren
Andrzej Bialecki wrote: Apache Wiki wrote: Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The following page has been changed by SamiSiren: http://wiki.apache.org/nutch/InstallingWeb2

[jira] Commented: (NUTCH-684) Dedup support for Solr

2009-02-20 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675311#action_12675311 ] Dmitry Lihachev commented on NUTCH-684: --- bq. there is a silent assumption that Solr

[jira] Commented: (NUTCH-477) Extend URLFilters to support different filtering chains

2009-02-20 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675312#action_12675312 ] Andrzej Bialecki commented on NUTCH-477: - (auto-review ;) ) After reflecting on

[jira] Commented: (NUTCH-684) Dedup support for Solr

2009-02-20 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675315#action_12675315 ] Doğacan Güney commented on NUTCH-684: - I wasn't thinking of putting this in for 1.0, but

[jira] Created: (NUTCH-699) Add an official solr schema for solr integration

2009-02-20 Thread JIRA
Add an official solr schema for solr integration -- Key: NUTCH-699 URL: https://issues.apache.org/jira/browse/NUTCH-699 Project: Nutch Issue Type: New Feature Components: indexer

[jira] Commented: (NUTCH-699) Add an official solr schema for solr integration

2009-02-20 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675317#action_12675317 ] Doğacan Güney commented on NUTCH-699: - Schema in NUTCH-442 may be a good starting point.

[jira] Created: (NUTCH-700) Neko1.9.11 goes into a loop

2009-02-20 Thread julien nioche (JIRA)
Neko1.9.11 goes into a loop --- Key: NUTCH-700 URL: https://issues.apache.org/jira/browse/NUTCH-700 Project: Nutch Issue Type: Bug Affects Versions: 1.0.0 Reporter: julien nioche Priority:

[jira] Commented: (NUTCH-684) Dedup support for Solr

2009-02-20 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675323#action_12675323 ] Andrzej Bialecki commented on NUTCH-684: - IMHO it would be good to have this

[jira] Commented: (NUTCH-699) Add an official solr schema for solr integration

2009-02-20 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675324#action_12675324 ] Dmitry Lihachev commented on NUTCH-699: --- I think we must extends field set for each

[jira] Commented: (NUTCH-700) Neko1.9.11 goes into a loop

2009-02-20 Thread julien nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675335#action_12675335 ] julien nioche commented on NUTCH-700: - Reported to CyberNeko

[jira] Commented: (NUTCH-694) Distributed Search Server fails

2009-02-20 Thread Dr. Nadine Hochstotter (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675375#action_12675375 ] Dr. Nadine Hochstotter commented on NUTCH-694: -- Hi, I reinstalled both sides

[jira] Commented: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19

2009-02-20 Thread julien nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675518#action_12675518 ] julien nioche commented on NUTCH-692: - I have been investigating this a bit more. Same