[
http://issues.apache.org/jira/browse/NUTCH-335?page=comments#action_12424855 ]
Siddharudh nadgeri commented on NUTCH-335:
--
I searched before but solution was not there alternative you have given and if
you know any pdf properties
Stefan Groschupf wrote:
Hi,
I have some code using queue based mechanism and java nio.
In my tests it is 4 times faster than the existing fetcher.
But:
+ I need to fix some more bugs
+ we need to re factor the robots.txt part since it is not usable
outside the http protocols yet.
IMO, also
[ http://issues.apache.org/jira/browse/NUTCH-318?page=all ]
Sami Siren resolved NUTCH-318.
--
Fix Version/s: 0.8.1
Resolution: Fixed
Assignee: Sami Siren
marking this as resolved because it is now working ok in single node config.
log4j not
[
http://issues.apache.org/jira/browse/NUTCH-266?page=comments#action_12424930 ]
Sami Siren commented on NUTCH-266:
--
just adding a remainder:
there are two options to get this fixed, use patched version of hadoop-0.4.0 or
wait until
Harvested links shouldn't get db.score.injected in addition to inbound
contributions
Key: NUTCH-336
URL: http://issues.apache.org/jira/browse/NUTCH-336
Project:
Fetcher ignores the fetcher.parse value configured in config file
-
Key: NUTCH-337
URL: http://issues.apache.org/jira/browse/NUTCH-337
Project: Nutch
Issue Type: Bug