Filtering ParseSegment

2009-12-10 Thread MilleBii
Before I start doing/testing/verifying, I'd like to check if I'm missing something and I understand correctly the mechanics -- -MilleBii-

Re: [jira] Commented: (NUTCH-776) Configurable queue depth

2010-01-07 Thread MilleBii
URL: https://issues.apache.org/jira/browse/NUTCH-776 Project: Nutch Issue Type: Improvement Components: fetcher Affects Versions: 1.1 Reporter: MilleBii Priority: Minor Fix For: 1.1 I propose that we create

Re: Injecting urls and define Inlink

2010-01-19 Thread MilleBii
. Cheers, Markus -- -MilleBii-

Re: [jira] Commented: (NUTCH-779) Mechanism for passing metadata from parse to crawldb

2010-01-20 Thread MilleBii
parse metadata to the corresponding entry of the crawldb. Comments are welcome -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. -- -MilleBii-

Re: adding an Index attribute

2010-03-07 Thread MilleBii
and retrieve it ? I want to sort the results based on that attribute value for each page. Any clues on this? -- -MilleBii-

Re: Nutch 2.0 roadmap

2010-04-07 Thread MilleBii
     ___. ___ ___ ___ _ _   __ [__ || __|__/|__||\/|  Information Retrieval, Semantic Web ___|||__||  \|  ||  |  Embedded Unix, System Integration http://www.sigram.com  Contact: info at sigram dot com -- Doğacan Güney -- -MilleBii-

Re: Nutch 2.0 roadmap

2010-04-08 Thread MilleBii
urls for exploring in a different way. This looks like hard to do right now 2010/4/8, Doğacan Güney doga...@gmail.com: Hi, On Wed, Apr 7, 2010 at 21:19, MilleBii mille...@gmail.com wrote: Just a question ? Will the new HBase implementation allow more sophisticated crawling strategies than

Re: Developing Nutch for semantic search

2010-04-19 Thread MilleBii
to proceed from where to start.     Help me how could I proceed Adarsh -- -MilleBii-

[jira] Updated: (NUTCH-770) Timebomb for Fetcher

2009-11-28 Thread MilleBii (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] MilleBii updated NUTCH-770: --- Attachment: log-770 Please find the logs of the patch... I did effectively try it but I could not compile

[jira] Commented: (NUTCH-770) Timebomb for Fetcher

2009-11-28 Thread MilleBii (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783252#action_12783252 ] MilleBii commented on NUTCH-770: That's what I did and just retried ... so I'm a bit

[jira] Issue Comment Edited: (NUTCH-770) Timebomb for Fetcher

2009-11-29 Thread MilleBii (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783252#action_12783252 ] MilleBii edited comment on NUTCH-770 at 11/29/09 8:47 PM: -- That's

[jira] Commented: (NUTCH-770) Timebomb for Fetcher

2009-12-05 Thread MilleBii (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12786443#action_12786443 ] MilleBii commented on NUTCH-770: Tried it succesfully on a windows platform. It does

[jira] Issue Comment Edited: (NUTCH-770) Timebomb for Fetcher

2009-12-05 Thread MilleBii (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12786443#action_12786443 ] MilleBii edited comment on NUTCH-770 at 12/5/09 4:50 PM: - Tried

[jira] Created: (NUTCH-776) Configurable queue depth

2009-12-17 Thread MilleBii (JIRA)
: MilleBii Priority: Minor Fix For: 1.1 I propose that we create a configurable item for the queuedepth in Fetcher.java instead of the hard-coded value of 50. key name : fetcher.queues.depth Default value : remains 50 (of course) -- This message is automatically generated