[
https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465700
]
Armel Nene commented on NUTCH-61:
-
I have attached a new patch as the old one need updating before using with
Nutch
[
https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465493
]
Sami Siren commented on NUTCH-61:
-
Havent looked the patch (tm)
How would one manage segments after something linke
[
https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465517
]
Andrzej Bialecki commented on NUTCH-61:
Actually, there is a way to do this, and this patch implements it.
[
https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465540
]
Sami Siren commented on NUTCH-61:
-
ok, so in my usual use case where there are far more urls than I can fetch this
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12449128 ]
Armel Nene commented on NUTCH-61:
-
Has this patch by any chance been included in the newer release of nucth or is
any one using as Otis asked. The reason is I am
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12449170 ]
Andrzej Bialecki commented on NUTCH-61:
Unfortunately, this patch hasn't been applied yet, due to its complexity and
lack of testing.
But it will be,
Andrzej, the feature that I am after can be implemented by this patch if I
just adapt it right. I am not sure of this but the patch seems a little bit
old to be implemented in the latest release of Nutch 0.8.1.
I want to implement a feature where the fetcher will fetch files but only
add them if
Armel T. Nene wrote:
Andrzej, the feature that I am after can be implemented by this patch if I
just adapt it right. I am not sure of this but the patch seems a little bit
old to be implemented in the latest release of Nutch 0.8.1.
Right, that's why I wrote it needs to be brought
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12444514 ]
Otis Gospodnetic commented on NUTCH-61:
---
Has anyone been using the code with this patch applied? Just wondering if/how
well it works.
Adaptive re-fetch
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12368050 ]
Jerome Charron commented on NUTCH-61:
-
Not an objection, but a simple comment.
Why not making FetchSchedule a new ExtensionPoint and then DefaultFetchSchedule
and
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12368051 ]
Andrzej Bialecki commented on NUTCH-61:
I contemplated this for a while, and then decided against it.
The main reason was that currently most of the pluggable
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361346 ]
byron miller commented on NUTCH-61:
---
Most definately! I'll be happy to give it a whirl!
Adaptive re-fetch interval. Detecting umodified content
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361302 ]
byron miller commented on NUTCH-61:
---
Is there a patch modified for the current branch or should i take a stab at
this?
Adaptive re-fetch interval. Detecting umodified
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361311 ]
Andrzej Bialecki commented on NUTCH-61:
I'm working on this, the patch will be available in a couple of days. I could
use then your help with review and testing... ;-)
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361131 ]
raghavendra prabhu commented on NUTCH-61:
-
Will the same thing work for a filesystem
For a file system , We can directly get the modified date store it in the db
The
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361133 ]
Andrzej Bialecki commented on NUTCH-61:
This patch already supports this. Anyway, it needs to be significantly
re-worked to fit into the current development version.
16 matches
Mail list logo