On 6/27/07, Kai_testing Middleton [EMAIL PROTECTED] wrote:
wow, setting db.max.outlinks.per.page immediately fixed my problem. It looks
like I totally mis-diagnosed things.
May I pose two questions:
1) how did you view all the outlinks?
bin/nutch plugin parse-html
[
https://issues.apache.org/jira/browse/NUTCH-474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508747
]
Hudson commented on NUTCH-474:
--
Integrated in Nutch-Nightly #131 (See
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508748
]
Hudson commented on NUTCH-498:
--
Integrated in Nutch-Nightly #131 (See
[
https://issues.apache.org/jira/browse/NUTCH-499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508749
]
Hudson commented on NUTCH-499:
--
Integrated in Nutch-Nightly #131 (See
On 6/28/07, Hudson (JIRA) [EMAIL PROTECTED] wrote:
[
https://issues.apache.org/jira/browse/NUTCH-474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508747
]
Hudson commented on NUTCH-474:
--
Integrated in Nutch-Nightly #131
[
https://issues.apache.org/jira/browse/NUTCH-392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508812
]
Doğacan Güney commented on NUTCH-392:
-
OK, I have done a bit of testing on compression but I'm stuck. Here it is:
[
https://issues.apache.org/jira/browse/NUTCH-392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508816
]
Andrzej Bialecki commented on NUTCH-392:
-
Re: Content versioning - we can use negative int values as version
[
https://issues.apache.org/jira/browse/NUTCH-392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508818
]
Doğacan Güney commented on NUTCH-392:
-
Re: Content versioning - we can use negative int values as version
[
https://issues.apache.org/jira/browse/NUTCH-392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508820
]
Sami Siren commented on NUTCH-392:
--
But why is parse_text_block's size so close to parse_text
data of parse_text
[
https://issues.apache.org/jira/browse/NUTCH-392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508823
]
Doğacan Güney commented on NUTCH-392:
-
data of parse_text is already compressed so recompressing it does not
[
https://issues.apache.org/jira/browse/NUTCH-392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508861
]
Doğacan Güney commented on NUTCH-392:
-
After changing ParseText to not do any internal compression, segment
Where can I find the library for import
com.etranslate.tm.processing.rtf.ParseException; java source code.
[
https://issues.apache.org/jira/browse/NUTCH-392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508900
]
Andrzej Bialecki commented on NUTCH-392:
-
Excellent work, Doğacan - thank you. The numbers for RECORD
I found the jar file.
I like to join the nutch developer team.
Where shall I get start?
Adam Shuy
President
ePacific Web Design Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-Original Message-
From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED]
Sent:
I tried to create eclipse launcher, but I got the following error:
Exception in thread main java.io.IOException: Input directory
C:/JavaSearchEngine/nutch-0.8.1/urls in local is invalid.
How to solve the above error?
Adam Shuy
President
ePacific Web Design Hosting
Professional Web/Software
15 matches
Mail list logo