[
http://issues.apache.org/jira/browse/NUTCH-354?page=comments#action_12429496 ]
Stefan Groschupf commented on NUTCH-354:
Since this issue is already closed I can not attach the patch file, so I attach
it as text within this comment.
If
[
http://issues.apache.org/jira/browse/NUTCH-356?page=comments#action_12429534 ]
Stefan Groschupf commented on NUTCH-356:
Hi Enrico,
there will be as much PluginRepositories as Configuration objects.
So in case you create many
crawling simulation
---
Key: NUTCH-357
URL: http://issues.apache.org/jira/browse/NUTCH-357
Project: Nutch
Issue Type: Improvement
Affects Versions: 0.8.1, 0.9.0
Reporter: Stefan Groschupf
Fix
[ http://issues.apache.org/jira/browse/NUTCH-357?page=all ]
Stefan Groschupf updated NUTCH-357:
---
Attachment: protocol-simulation-pluginV1.patch
A very first preview of a plugin that helps to simulate crawls. This protocol
plugin can be used to
MapWritable, nextEntry is not reset when Entries are recycled
---
Key: NUTCH-354
URL: http://issues.apache.org/jira/browse/NUTCH-354
Project: Nutch
Issue Type: Bug
Affects
[ http://issues.apache.org/jira/browse/NUTCH-354?page=all ]
Stefan Groschupf updated NUTCH-354:
---
Attachment: resetNextEntryInMapWritableV1.patch
Resets the next Entry of a recycled entry.
MapWritable, nextEntry is not reset when Entries are
[
http://issues.apache.org/jira/browse/NUTCH-343?page=comments#action_12428920 ]
Stefan Groschupf commented on NUTCH-343:
Thanks for the contribution, also that your patch has a test. :-)
Just a small comment from taking a first look to
[ http://issues.apache.org/jira/browse/NUTCH-341?page=all ]
Stefan Groschupf updated NUTCH-341:
---
Attachment: doNotDeleteTmpIndexMergeDirV1.patch
+1.
I agree it makes completly no sense to be required creating a tmp folder
manually and nutch deletes
[ http://issues.apache.org/jira/browse/NUTCH-337?page=all ]
Stefan Groschupf updated NUTCH-337:
---
Attachment: respectFetcherParsePropertyV1.patch
Hi Jeremy, thanks for catching this. Attached a fix. Should be easy for a
contributor to commit this to
[ http://issues.apache.org/jira/browse/NUTCH-337?page=all ]
Stefan Groschupf updated NUTCH-337:
---
Priority: Major (was: Trivial)
Fetcher ignores the fetcher.parse value configured in config file
urls blocked db.fetch.retry.max * http.max.delays times during fetching are
marked as STATUS_DB_GONE
--
Key: NUTCH-350
URL:
[ http://issues.apache.org/jira/browse/NUTCH-350?page=all ]
Stefan Groschupf updated NUTCH-350:
---
Attachment: protocolRetryV5.patch
This patch will dramatically increase the number of successfully fetched pages
of a intranet crawl over the time.
[
http://issues.apache.org/jira/browse/NUTCH-322?page=comments#action_12428858 ]
Stefan Groschupf commented on NUTCH-322:
I think this is a serious problem. Page A server side redirect to Page B. Page
A is never writen to the output.
pages that serverside forwards will be refetched every time
---
Key: NUTCH-353
URL: http://issues.apache.org/jira/browse/NUTCH-353
Project: Nutch
Issue Type: Bug
Affects Versions:
[ http://issues.apache.org/jira/browse/NUTCH-353?page=all ]
Stefan Groschupf updated NUTCH-353:
---
Attachment: doNotRefecthForwarderPagesV1.patch
Since we discussed that nutch need to be more polite we should fix that asap.
pages that serverside
[ http://issues.apache.org/jira/browse/NUTCH-322?page=all ]
Stefan Groschupf resolved NUTCH-322.
Resolution: Duplicate
duplicate of NUTCH-353
Fetcher discards ProtocolStatus, doesn't store redirected pages
[
http://issues.apache.org/jira/browse/NUTCH-347?page=comments#action_12428915 ]
Stefan Groschupf commented on NUTCH-347:
Please submit this patch!
Thanks!
Build: plugins' Jars not found
--
[
http://issues.apache.org/jira/browse/NUTCH-346?page=comments#action_12428917 ]
Stefan Groschupf commented on NUTCH-346:
+1
I agree, can you please create a patch file and attach it to this bug.
Thanks
Improve readability of
[
http://issues.apache.org/jira/browse/NUTCH-345?page=comments#action_12428918 ]
Stefan Groschupf commented on NUTCH-345:
Shouldn't the DeflateUtils also be part of the protocol-http plugin?
Also since it is a larger contribution and
[
http://issues.apache.org/jira/browse/NUTCH-349?page=comments#action_12428537 ]
Stefan Groschupf commented on NUTCH-349:
my vote goes to #2.
Having a tool that need to be started manually would be better than complicate
the already
[
http://issues.apache.org/jira/browse/NUTCH-233?page=comments#action_12428542 ]
Stefan Groschupf commented on NUTCH-233:
Hi Otis,
yes for a serious whole web crawl I need to change this reg ex first.
It only hangs with some random urls
[ http://issues.apache.org/jira/browse/NUTCH-348?page=all ]
Stefan Groschupf updated NUTCH-348:
---
Attachment: sortPatchV1.patch
What people think about this kind of solution?
Generator is building fetch list using *lowest* scoring URLs
doubling score causes by page internal anchors.
---
Key: NUTCH-332
URL: http://issues.apache.org/jira/browse/NUTCH-332
Project: Nutch
Issue Type: Bug
Affects Versions: 0.8-dev
[
http://issues.apache.org/jira/browse/NUTCH-318?page=comments#action_12423539 ]
Stefan Groschupf commented on NUTCH-318:
Yes this happens only in a distributed environment. Please also see my last
mail in the hadoop dev list. I think
[
http://issues.apache.org/jira/browse/NUTCH-318?page=comments#action_12423433 ]
Stefan Groschupf commented on NUTCH-318:
Shouldn't that be fixed in .8 since by today this tool just produce no output?!
log4j not proper configured,
[
http://issues.apache.org/jira/browse/NUTCH-233?page=comments#action_12423438 ]
Stefan Groschupf commented on NUTCH-233:
I think this should be fixed in .8 too, since everybody that does real whole
web crawl with over a 100 Mio pages
UrlFilters.java throws NPE in case urlfilter.order contains Filters that are
not in plugin.includes
---
Key: NUTCH-325
URL:
[ http://issues.apache.org/jira/browse/NUTCH-325?page=all ]
Stefan Groschupf updated NUTCH-325:
---
Attachment: UrlFiltersNPE.patch
A patch that uses a Arralist instead of an array and put only entries into the
list when the entry is not null. Means
[ http://issues.apache.org/jira/browse/NUTCH-323?page=all ]
Stefan Groschupf updated NUTCH-323:
---
Attachment: MapWritableCopyConstructor.patch
Attached patch add a copy constructor to the map writable and use it in the
CrawlDatum.set methode. However
db.score.link.internal and db.score.link.external are ignored
-
Key: NUTCH-324
URL: http://issues.apache.org/jira/browse/NUTCH-324
Project: Nutch
Issue Type: Improvement
[ http://issues.apache.org/jira/browse/NUTCH-324?page=all ]
Stefan Groschupf updated NUTCH-324:
---
Attachment: InternalAndExternalLinkScoreFactor.patch
Multiply the score of a page during distributeScoreToOutlink with
db.score.link.internal or
[ http://issues.apache.org/jira/browse/NUTCH-319?page=all ]
Stefan Groschupf resolved NUTCH-319.
Resolution: Won't Fix
Sorry, that is bogus since it is wriiten to the logging stream.
OPICScoringFilter should use logging API instead of
OPICScoringFilter should use logging API instead of printStackTrace
---
Key: NUTCH-319
URL: http://issues.apache.org/jira/browse/NUTCH-319
Project: Nutch
Issue Type: Bug
log4j not proper configured, readdb doesnt give any information
---
Key: NUTCH-318
URL: http://issues.apache.org/jira/browse/NUTCH-318
Project: Nutch
Type: Bug
Versions: 0.8-dev
Reporter:
[ http://issues.apache.org/jira/browse/NUTCH-289?page=all ]
Stefan Groschupf updated NUTCH-289:
---
Attachment: ipInCrawlDatumDraftV5.patch
Release Candidate 1 of this patch.
This patch contains:
+ add IP Address to CrawlDatum Version 5 (as byte[4])
+
[ http://issues.apache.org/jira/browse/NUTCH-289?page=all ]
Stefan Groschupf updated NUTCH-289:
---
Attachment: ipInCrawlDatumDraftV4.patch
Attached a patch that does only use any time 4 byte for the ip. Means we do
ignore ipv6. This save us a 4 byte in
java doc of CrawlDb is wrong
Key: NUTCH-302
URL: http://issues.apache.org/jira/browse/NUTCH-302
Project: Nutch
Type: Bug
Reporter: Stefan Groschupf
Priority: Trivial
Fix For: 0.8-dev
CrawlDb has the same java doc as
[ http://issues.apache.org/jira/browse/NUTCH-301?page=all ]
Stefan Groschupf updated NUTCH-301:
---
Attachment: CommonGramsCacheV1.patch
Cache HashMap COMMON_TERMS in configuration instance.
CommonGrams loads analysis.common.terms.file for each query
[
http://issues.apache.org/jira/browse/NUTCH-293?page=comments#action_12415171 ]
Stefan Groschupf commented on NUTCH-293:
Any comments? There was already a posting in the nutch agent mailing list,
where someone had banned nutch since nutch does not
[
http://issues.apache.org/jira/browse/NUTCH-293?page=comments#action_12415236 ]
Stefan Groschupf commented on NUTCH-293:
Hi Andrzej,
I agree but writing a queue based fetcher is a big step. I already have some
basic code (nio based).
Also I don't
[
http://issues.apache.org/jira/browse/NUTCH-258?page=comments#action_12414763 ]
Stefan Groschupf commented on NUTCH-258:
Scott,
I agree with you. However we need a clean patch to solve the problem, we can
not just comment things out of the code.
[ http://issues.apache.org/jira/browse/NUTCH-289?page=all ]
Stefan Groschupf updated NUTCH-289:
---
Attachment: ipInCrawlDatumDraftV1.patch
To keep the discussion alive attached a _first draft_ for storing the ip in the
crawlDatum for public discussion.
[ http://issues.apache.org/jira/browse/NUTCH-298?page=all ]
Stefan Groschupf updated NUTCH-298:
---
Summary: if a 404 for a robots.txt is returned a NPE is thrown (was: if a
404 for a robots.txt is returned no page is fetched at all from the host)
if a 404 for a robots.txt is returned no page is fetched at all from the host
-
Key: NUTCH-298
URL: http://issues.apache.org/jira/browse/NUTCH-298
Project: Nutch
Type: Bug
Reporter:
[ http://issues.apache.org/jira/browse/NUTCH-298?page=all ]
Stefan Groschupf updated NUTCH-298:
---
Attachment: fixNpeRobotRuleSet.patch
fix the npe in RobotRuleSet happen in case we use a empthy RuleSet
if a 404 for a robots.txt is returned no page is
[
http://issues.apache.org/jira/browse/NUTCH-282?page=comments#action_12414435 ]
Stefan Groschupf commented on NUTCH-282:
Is that related to host grouping we discussed? Can we in this case close this
bug?
Showing too few results on a page (Paging
[
http://issues.apache.org/jira/browse/NUTCH-286?page=comments#action_12414439 ]
Stefan Groschupf commented on NUTCH-286:
This is difficult to realize since the http error code is readed from response
in the fetcher and setted into the protocol
[
http://issues.apache.org/jira/browse/NUTCH-292?page=comments#action_12414443 ]
Stefan Groschupf commented on NUTCH-292:
+1, Can someone create a clean patch file?
OpenSearchServlet: OutOfMemoryError: Java heap space
[
http://issues.apache.org/jira/browse/NUTCH-291?page=comments#action_12414445 ]
Stefan Groschupf commented on NUTCH-291:
lastModified will be only indexed if you switch on the index-more plugin.
If you think you should change the way lastmodified
[
http://issues.apache.org/jira/browse/NUTCH-290?page=comments#action_12414448 ]
Stefan Groschupf commented on NUTCH-290:
If a parser throws an exeption:
Fetcher, 261:
try {
parse = this.parseUtil.parse(content);
parseStatus =
[ http://issues.apache.org/jira/browse/NUTCH-287?page=all ]
Stefan Groschupf closed NUTCH-287:
--
Resolution: Won't Fix
http://www.mail-archive.com/nutch-user%40lucene.apache.org/msg04696.html
Exception when searching with sort
[ http://issues.apache.org/jira/browse/NUTCH-284?page=all ]
Stefan Groschupf closed NUTCH-284:
--
Resolution: Won't Fix
Yes, I was missing index-basic.
NullPointerException during index
-
Key: NUTCH-284
[
http://issues.apache.org/jira/browse/NUTCH-284?page=comments#action_12414453 ]
Stefan Groschupf commented on NUTCH-284:
Please try discuss such things first in the user mailing list than open a
issue.
Maintaining the issue tracking is very time
[
http://issues.apache.org/jira/browse/NUTCH-281?page=comments#action_12414454 ]
Stefan Groschupf commented on NUTCH-281:
Can you submit a patch file?
cached.jsp: base-href needs to be outside comments
[
http://issues.apache.org/jira/browse/NUTCH-274?page=comments#action_12414457 ]
Stefan Groschupf commented on NUTCH-274:
Should we fix this in TextInputFormat of Hadoop to ignore emthy lines or in the
Injector?
Empty row in/at end of URL-list
[
http://issues.apache.org/jira/browse/NUTCH-290?page=comments#action_12414469 ]
Stefan Groschupf commented on NUTCH-290:
As far I understand the code, the next parser is only used if the previous
parser return with a unsuccessfully paring status.
[ http://issues.apache.org/jira/browse/NUTCH-286?page=all ]
Stefan Groschupf closed NUTCH-286:
--
Resolution: Won't Fix
I hope everybody agree with the statement: We can not detect http response
codes based on responded html content.
Prune the
support for Crawl-delay in Robots.txt
-
Key: NUTCH-293
URL: http://issues.apache.org/jira/browse/NUTCH-293
Project: Nutch
Type: Improvement
Components: fetcher
Versions: 0.8-dev
Reporter: Stefan Groschupf
[ http://issues.apache.org/jira/browse/NUTCH-293?page=all ]
Stefan Groschupf updated NUTCH-293:
---
Attachment: crawlDelayv1.patch
A frist darft of a crawl delay support for nutch. The problem I see is that in
case ip based delay is configured it can
[
http://issues.apache.org/jira/browse/NUTCH-289?page=comments#action_12413940 ]
Stefan Groschupf commented on NUTCH-289:
+1
Andrzej, I agree that lookup the ip in ParseOutputFormat would be the best as
Doug suggested.
The biggest problem nutch has
[
http://issues.apache.org/jira/browse/NUTCH-249?page=comments#action_12376477 ]
Stefan Groschupf commented on NUTCH-249:
I mean the Class and method naming isn't very well.
Blacklist or blocklist? Whitelist or positivivelist?
Does this answer the
Administration GUI
--
Key: NUTCH-251
URL: http://issues.apache.org/jira/browse/NUTCH-251
Project: Nutch
Type: Improvement
Versions: 0.8-dev
Reporter: Stefan Groschupf
Priority: Minor
Fix For: 0.8-dev
Having a web based
[ http://issues.apache.org/jira/browse/NUTCH-249?page=all ]
Stefan Groschupf updated NUTCH-249:
---
Attachment: blackWhiteListV2.patch
A concept tryout of black- white list filtering. I'm looking for beta tester
and improvement suggestions. (Especially
[ http://issues.apache.org/jira/browse/NUTCH-246?page=all ]
Stefan Groschupf updated NUTCH-246:
---
Attachment: injectWithCurTimeMapper.patch
setFetchTime moved to Mapper.
segment size is never as big as topN or crawlDB size in a distributed
segment size is never as big as topN or crawlDB size in a distributed
deployement
-
Key: NUTCH-246
URL: http://issues.apache.org/jira/browse/NUTCH-246
Project: Nutch
Type: Bug
robot parser to restrict.
-
Key: NUTCH-247
URL: http://issues.apache.org/jira/browse/NUTCH-247
Project: Nutch
Type: Bug
Components: fetcher
Versions: 0.8-dev
Reporter: Stefan Groschupf
Priority: Minor
Fix For:
[
http://issues.apache.org/jira/browse/NUTCH-233?page=comments#action_12370686 ]
Stefan Groschupf commented on NUTCH-233:
Sorry, I haven't such url since it happens until reducing a fetch. Reducing
provides no logging and map data will be deleted
wrong regular expression hang reduce process for ever
--
Key: NUTCH-233
URL: http://issues.apache.org/jira/browse/NUTCH-233
Project: Nutch
Type: Bug
Versions: 0.8-dev
Reporter: Stefan Groschupf
improved handling of plugin folder configuration
Key: NUTCH-229
URL: http://issues.apache.org/jira/browse/NUTCH-229
Project: Nutch
Type: Improvement
Reporter: Stefan Groschupf
Priority: Critical
Fix For:
[ http://issues.apache.org/jira/browse/NUTCH-229?page=all ]
Stefan Groschupf updated NUTCH-229:
---
Attachment: pluginFolder.patch
A patch to be able using relative path that are not in the classpath.
improved handling of plugin folder configuration
CrawlDb Filter tool
---
Key: NUTCH-226
URL: http://issues.apache.org/jira/browse/NUTCH-226
Project: Nutch
Type: Improvement
Reporter: Stefan Groschupf
Priority: Minor
A tool to filter a existing crawlDb
--
This message is automatically
[ http://issues.apache.org/jira/browse/NUTCH-226?page=all ]
Stefan Groschupf updated NUTCH-226:
---
Attachment: crawlDbFilter.patch
Patch with tool to filter a existing crawlDb. In any case backup your crawlDb
first.
CrawlDb Filter tool
[ http://issues.apache.org/jira/browse/NUTCH-222?page=all ]
Stefan Groschupf closed NUTCH-222:
--
Resolution: Fixed
Hi,
I guess it is a typo, try invertlinks in case the nutch script does not know
the command as in your case invertlink it tries to
[
http://issues.apache.org/jira/browse/NUTCH-204?page=comments#action_12367991 ]
Stefan Groschupf commented on NUTCH-204:
Jérôme,
After taking a look to the HitDetails object again - after a some time - I
notice I completely had overseen that there
[
http://issues.apache.org/jira/browse/NUTCH-204?page=comments#action_12368038 ]
Stefan Groschupf commented on NUTCH-204:
Yes that is a good idea. Thanks for getting this into the sources.
Cheers,
Stefan
multiple field values in HitDetails
[
http://issues.apache.org/jira/browse/NUTCH-204?page=comments#action_12367520 ]
Stefan Groschupf commented on NUTCH-204:
There is something I don't understand with this patch. The way Lucene manage
multi-valued fields is to have many mono-valued
[
http://issues.apache.org/jira/browse/NUTCH-204?page=comments#action_12367539 ]
Stefan Groschupf commented on NUTCH-204:
Woudn't you end with something very similar as it is now, having one key and
multiple values per key?
The Lucene Document
[
http://issues.apache.org/jira/browse/NUTCH-204?page=comments#action_12367552 ]
Stefan Groschupf commented on NUTCH-204:
Make sense, I see, thanks for the clarification.
multiple field values in HitDetails
---
checkstyle
--
Key: NUTCH-213
URL: http://issues.apache.org/jira/browse/NUTCH-213
Project: Nutch
Type: Improvement
Versions: 0.8-dev
Reporter: Stefan Groschupf
Priority: Minor
Adding checkstyle target to ant build file to support
[ http://issues.apache.org/jira/browse/NUTCH-213?page=all ]
Stefan Groschupf updated NUTCH-213:
---
Attachment: checkstyle.patch
checkstyle-all-4.1.jar
As part of my learning lesson 'whitespace' I added a checkstyle target to the
build
[
http://issues.apache.org/jira/browse/NUTCH-211?page=comments#action_12366645 ]
Stefan Groschupf commented on NUTCH-211:
Raghavendra, I'm not sure if I also close the linkDB reader, may be I missed
that. I will check this later today and may come
[ http://issues.apache.org/jira/browse/NUTCH-211?page=all ]
Stefan Groschupf updated NUTCH-211:
---
Attachment: closeable160206.patch
Now also closing linkdb reader and file system, thanks to Raghavendra.
FetchedSegments leave readers open
FetchedSegments leave readers open
---
Key: NUTCH-211
URL: http://issues.apache.org/jira/browse/NUTCH-211
Project: Nutch
Type: Bug
Versions: 0.8-dev
Reporter: Stefan Groschupf
Priority: Critical
Fix For: 0.8-dev
[
http://issues.apache.org/jira/browse/NUTCH-204?page=comments#action_12366472 ]
Stefan Groschupf commented on NUTCH-204:
Any improvment suggestions or negative comments? If not it would be great if
one with write access to the svn can commit this
[ http://issues.apache.org/jira/browse/NUTCH-211?page=all ]
Stefan Groschupf reassigned NUTCH-211:
--
Assign To: Stefan Groschupf
FetchedSegments leave readers open
--
Key: NUTCH-211
URL:
[ http://issues.apache.org/jira/browse/NUTCH-211?page=all ]
Stefan Groschupf updated NUTCH-211:
---
Attachment: closeFetchSegments.patch
NutchBean, FetchedSegments,FetchedSegments.Segment IndexSearcher and HitContent
now extends / implements the hadoop
[ http://issues.apache.org/jira/browse/NUTCH-192?page=all ]
Stefan Groschupf updated NUTCH-192:
---
Attachment: metadata08_02_06.patch
Doug, I'm afraid there is a missunderstanding or may be I just do not
understand your comments.
A plugin never need
multiple field values in HitDetails
---
Key: NUTCH-204
URL: http://issues.apache.org/jira/browse/NUTCH-204
Project: Nutch
Type: Improvement
Components: searcher
Versions: 0.8-dev
Reporter: Stefan Groschupf
Fix
[ http://issues.apache.org/jira/browse/NUTCH-204?page=all ]
Stefan Groschupf updated NUTCH-204:
---
Attachment: DetailGetValues070206.patch
Patch that adding getValues to HitDetails.
multiple field values in HitDetails
[
http://issues.apache.org/jira/browse/NUTCH-192?page=comments#action_12364788 ]
Stefan Groschupf commented on NUTCH-192:
That's true. In any case I don't wan't to store the class id map. Since if we
do that, you are right we can use strings.
What
[
http://issues.apache.org/jira/browse/NUTCH-192?page=comments#action_12364795 ]
Stefan Groschupf commented on NUTCH-192:
A perfect plan, I will do that so and commit a new patch. :)
THANKS!
meta data support for CrawlDatum
[ http://issues.apache.org/jira/browse/NUTCH-192?page=all ]
Stefan Groschupf updated NUTCH-192:
---
Attachment: metadata010206.patch
As discussed...
meta data support for CrawlDatum
Key: NUTCH-192
[
http://issues.apache.org/jira/browse/NUTCH-192?page=comments#action_12364683 ]
Stefan Groschupf commented on NUTCH-192:
Andrzej, Doug. I'm not sure if I understand you correct, do you suggest to have
string keys and values, or just string keys?
[
http://issues.apache.org/jira/browse/NUTCH-192?page=comments#action_12364699 ]
Stefan Groschupf commented on NUTCH-192:
* plus whatever it takes to put the class name-id mapping in the MapWritable
header (the mapping table): let's assume 40
[ http://issues.apache.org/jira/browse/NUTCH-192?page=all ]
Stefan Groschupf updated NUTCH-192:
---
Attachment: metadata310106.patch
Now 1 byte for the class type and the size of the type itself, this means we
can have only 2 byte keys and 2 byte values
[ http://issues.apache.org/jira/browse/NUTCH-192?page=all ]
Stefan Groschupf updated NUTCH-192:
---
Attachment: metadata300106.patch
Attached a first suggestion for a patch to adding meta data support into
crawlDatum.
In general I created a MapWritable
[
http://issues.apache.org/jira/browse/NUTCH-14?page=comments#action_12364401 ]
Stefan Groschupf commented on NUTCH-14:
---
I didn't see that anymore, but I didn't make any newer heavy load test. We may
can close this for now.
NullPointerException
[
http://issues.apache.org/jira/browse/NUTCH-59?page=comments#action_12364136 ]
Stefan Groschupf commented on NUTCH-59:
---
Nutch 0.8 is very different to 0.7 in the way it stores page data and
linkgraph. Therefore a reimplementation of meta data
[ http://issues.apache.org/jira/browse/NUTCH-127?page=all ]
Stefan Groschupf resolved NUTCH-127:
Resolution: Fixed
I guess it is solved, thanks. If able to reproduce it again I will just reopen
this or a new report.
Thanks!
uncorrect values
[
http://issues.apache.org/jira/browse/NUTCH-169?page=comments#action_12363116 ]
Stefan Groschupf commented on NUTCH-169:
Thanks, we will fix this in the beginning of next week.
remove static NutchConf
---
Key:
1 - 100 of 143 matches
Mail list logo