Generate log output for solr indexer and dedup
--
Key: NUTCH-697
URL: https://issues.apache.org/jira/browse/NUTCH-697
Project: Nutch
Issue Type: Improvement
Components: indexer
[
https://issues.apache.org/jira/browse/NUTCH-697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dmitry Lihachev updated NUTCH-697:
--
Attachment: NUTCH-697_solr_logs.patch
Generate log output for solr indexer and dedup
[
https://issues.apache.org/jira/browse/NUTCH-694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sami Siren updated NUTCH-694:
-
Attachment: NUTCH-694-2.patch
I rechecked this again and there was also something else wrong, I am
CrawlDb is corrupted after a few crawl cycles
-
Key: NUTCH-698
URL: https://issues.apache.org/jira/browse/NUTCH-698
Project: Nutch
Issue Type: Bug
Reporter: Doğacan Güney
[
https://issues.apache.org/jira/browse/NUTCH-698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney updated NUTCH-698:
Attachment: NUTCH-698_v1.patch
Patch for the issue.
Again, hadoop's MapWritable#putAll is broken
[
https://issues.apache.org/jira/browse/NUTCH-698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney updated NUTCH-698:
Priority: Blocker (was: Major)
Oops. Wrong priority. Elevating this to blocker.
CrawlDb is
Dear Wiki user,
You have subscribed to a wiki page or wiki category on Nutch Wiki for change
notification.
The following page has been changed by SamiSiren:
http://wiki.apache.org/nutch/InstallingWeb2
--
+ == NOTE:
[
https://issues.apache.org/jira/browse/NUTCH-694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sami Siren updated NUTCH-694:
-
Patch Info: [Patch Available]
Assignee: Sami Siren
Distributed Search Server fails
[
https://issues.apache.org/jira/browse/NUTCH-573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sami Siren updated NUTCH-573:
-
Patch Info: [Patch Available]
Multiple Domains - Query Search
---
[
https://issues.apache.org/jira/browse/NUTCH-477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sami Siren updated NUTCH-477:
-
Patch Info: [Patch Available]
Extend URLFilters to support different filtering chains
[
https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675309#action_12675309
]
Andrzej Bialecki commented on NUTCH-684:
-
A few comments to this patch (and to
Apache Wiki wrote:
Dear Wiki user,
You have subscribed to a wiki page or wiki category on Nutch Wiki for change
notification.
The following page has been changed by SamiSiren:
http://wiki.apache.org/nutch/InstallingWeb2
Andrzej Bialecki wrote:
Apache Wiki wrote:
Dear Wiki user,
You have subscribed to a wiki page or wiki category on Nutch Wiki
for change notification.
The following page has been changed by SamiSiren:
http://wiki.apache.org/nutch/InstallingWeb2
[
https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675311#action_12675311
]
Dmitry Lihachev commented on NUTCH-684:
---
bq. there is a silent assumption that Solr
[
https://issues.apache.org/jira/browse/NUTCH-477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675312#action_12675312
]
Andrzej Bialecki commented on NUTCH-477:
-
(auto-review ;) )
After reflecting on
[
https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675315#action_12675315
]
Doğacan Güney commented on NUTCH-684:
-
I wasn't thinking of putting this in for 1.0, but
Add an official solr schema for solr integration
--
Key: NUTCH-699
URL: https://issues.apache.org/jira/browse/NUTCH-699
Project: Nutch
Issue Type: New Feature
Components: indexer
[
https://issues.apache.org/jira/browse/NUTCH-699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675317#action_12675317
]
Doğacan Güney commented on NUTCH-699:
-
Schema in NUTCH-442 may be a good starting point.
Neko1.9.11 goes into a loop
---
Key: NUTCH-700
URL: https://issues.apache.org/jira/browse/NUTCH-700
Project: Nutch
Issue Type: Bug
Affects Versions: 1.0.0
Reporter: julien nioche
Priority:
[
https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675323#action_12675323
]
Andrzej Bialecki commented on NUTCH-684:
-
IMHO it would be good to have this
[
https://issues.apache.org/jira/browse/NUTCH-699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675324#action_12675324
]
Dmitry Lihachev commented on NUTCH-699:
---
I think we must extends field set for each
[
https://issues.apache.org/jira/browse/NUTCH-700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675335#action_12675335
]
julien nioche commented on NUTCH-700:
-
Reported to CyberNeko
[
https://issues.apache.org/jira/browse/NUTCH-694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675375#action_12675375
]
Dr. Nadine Hochstotter commented on NUTCH-694:
--
Hi,
I reinstalled both sides
[
https://issues.apache.org/jira/browse/NUTCH-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12675518#action_12675518
]
julien nioche commented on NUTCH-692:
-
I have been investigating this a bit more. Same
24 matches
Mail list logo