[jira] Commented: (NUTCH-335) Pdf summary corrupt issue

2006-08-01 Thread Siddharudh nadgeri (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-335?page=comments#action_12424855 ] Siddharudh nadgeri commented on NUTCH-335: -- I searched before but solution was not there alternative you have given and if you know any pdf properties

fetcher improvements (was: Re: 0.8 much slower than 0.7)

2006-08-01 Thread Sami Siren
Stefan Groschupf wrote: Hi, I have some code using queue based mechanism and java nio. In my tests it is 4 times faster than the existing fetcher. But: + I need to fix some more bugs + we need to re factor the robots.txt part since it is not usable outside the http protocols yet. IMO, also

[jira] Resolved: (NUTCH-318) log4j not proper configured, readdb doesnt give any information

2006-08-01 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-318?page=all ] Sami Siren resolved NUTCH-318. -- Fix Version/s: 0.8.1 Resolution: Fixed Assignee: Sami Siren marking this as resolved because it is now working ok in single node config. log4j not

[jira] Commented: (NUTCH-266) hadoop bug when doing updatedb

2006-08-01 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-266?page=comments#action_12424930 ] Sami Siren commented on NUTCH-266: -- just adding a remainder: there are two options to get this fixed, use patched version of hadoop-0.4.0 or wait until

[jira] Created: (NUTCH-336) Harvested links shouldn't get db.score.injected in addition to inbound contributions

2006-08-01 Thread Chris Schneider (JIRA)
Harvested links shouldn't get db.score.injected in addition to inbound contributions Key: NUTCH-336 URL: http://issues.apache.org/jira/browse/NUTCH-336 Project:

[jira] Created: (NUTCH-337) Fetcher ignores the fetcher.parse value configured in config file

2006-08-01 Thread Jeremy Huylebroeck (JIRA)
Fetcher ignores the fetcher.parse value configured in config file - Key: NUTCH-337 URL: http://issues.apache.org/jira/browse/NUTCH-337 Project: Nutch Issue Type: Bug