Before I start doing/testing/verifying, I'd like to check if I'm missing
something and I understand correctly the mechanics
--
-MilleBii-
URL: https://issues.apache.org/jira/browse/NUTCH-776
Project: Nutch
Issue Type: Improvement
Components: fetcher
Affects Versions: 1.1
Reporter: MilleBii
Priority: Minor
Fix For: 1.1
I propose that we create
.
Cheers,
Markus
--
-MilleBii-
parse metadata to the corresponding
entry of the crawldb.
Comments are welcome
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
--
-MilleBii-
and retrieve it ? I want to sort the
results based on that attribute value for each page.
Any clues on this?
--
-MilleBii-
___. ___ ___ ___ _ _ __
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com
--
Doğacan Güney
--
-MilleBii-
urls for exploring in a different way.
This looks like hard to do right now
2010/4/8, Doğacan Güney doga...@gmail.com:
Hi,
On Wed, Apr 7, 2010 at 21:19, MilleBii mille...@gmail.com wrote:
Just a question ?
Will the new HBase implementation allow more sophisticated crawling
strategies than
to proceed from where to start.
Help me how could I proceed
Adarsh
--
-MilleBii-
[
https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
MilleBii updated NUTCH-770:
---
Attachment: log-770
Please find the logs of the patch... I did effectively try it but I could not
compile
[
https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783252#action_12783252
]
MilleBii commented on NUTCH-770:
That's what I did and just retried ... so I'm a bit
[
https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783252#action_12783252
]
MilleBii edited comment on NUTCH-770 at 11/29/09 8:47 PM:
--
That's
[
https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12786443#action_12786443
]
MilleBii commented on NUTCH-770:
Tried it succesfully on a windows platform.
It does
[
https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12786443#action_12786443
]
MilleBii edited comment on NUTCH-770 at 12/5/09 4:50 PM:
-
Tried
: MilleBii
Priority: Minor
Fix For: 1.1
I propose that we create a configurable item for the queuedepth in Fetcher.java
instead of the hard-coded value of 50.
key name : fetcher.queues.depth
Default value : remains 50 (of course)
--
This message is automatically generated
14 matches
Mail list logo