[
http://issues.apache.org/jira/browse/NUTCH-95?page=comments#action_12330113 ]
Piotr Kosiorowski commented on NUTCH-95:
I was renaming segments quite often so I would vote for reading the date from
the segment instead of using dir name.
[ http://issues.apache.org/jira/browse/NUTCH-89?page=all ]
Piotr Kosiorowski closed NUTCH-89:
--
Fix Version: 0.8-dev
0.7
Resolution: Fixed
Applied in trunk and 0.7 branch. Thanks.
parse-rss null pointer exception
[ http://issues.apache.org/jira/browse/NUTCH-99?page=all ]
Piotr Kosiorowski closed NUTCH-99:
--
Resolution: Fixed
Patch committed. Thanks Stefan.
ports are hardcoded or random
-
Key: NUTCH-99
[
http://issues.apache.org/jira/browse/NUTCH-148?page=comments#action_12361128 ]
Piotr Kosiorowski commented on NUTCH-148:
-
Do you have Cygwin installed?
Is 'df' working in your cygwin installation?
Do you run crawl from cygwin shell?
Nutch
[
http://issues.apache.org/jira/browse/NUTCH-148?page=comments#action_12361206 ]
Piotr Kosiorowski commented on NUTCH-148:
-
'df' command is required for NDFS operation so if you were not using NDFS in
0.7.1 and nutch shell scripts you were able to
[ http://issues.apache.org/jira/browse/NUTCH-148?page=all ]
Piotr Kosiorowski closed NUTCH-148:
---
Resolution: Invalid
org.apache.nutch.tools.CrawlTool throws error while doing deleteduplicates
[ http://issues.apache.org/jira/browse/NUTCH-147?page=all ]
Piotr Kosiorowski closed NUTCH-147:
---
Resolution: Invalid
cygwin requirement on Windows is listed in nutch tutorial. Please reopen if
problems persists after using it from cygwin
[ http://issues.apache.org/jira/browse/NUTCH-42?page=all ]
Piotr Kosiorowski closed NUTCH-42:
--
Fix Version: 0.7.2-dev
0.8-dev
Resolution: Fixed
OpenSearch implemented.
enhance search.jsp such that it can also returns XML
[
http://issues.apache.org/jira/browse/NUTCH-142?page=comments#action_12361492 ]
Piotr Kosiorowski commented on NUTCH-142:
-
Thanks. Fixed in 0.7 branch. Left open to fix it in trunk after cleaning trunk
JUnit test problems (in next few days).
[
http://issues.apache.org/jira/browse/NUTCH-138?page=comments#action_12361520 ]
Piotr Kosiorowski commented on NUTCH-138:
-
I am not sure but I would suspect it is a problem of bad tomcat configuration.
To handle special characters in query urls
[ http://issues.apache.org/jira/browse/NUTCH-138?page=all ]
Piotr Kosiorowski closed NUTCH-138:
---
Resolution: Invalid
Setting URIEncoding in tomcat config file fixes the problem.
non-Latin-1 characters cannot be submitted for search
[
http://issues.apache.org/jira/browse/NUTCH-138?page=comments#action_12361549 ]
Piotr Kosiorowski commented on NUTCH-138:
-
BTW - just create user for yourself in nutch Wiki and you shoudl be able to add
a new page with information without
[ http://issues.apache.org/jira/browse/NUTCH-142?page=all ]
Piotr Kosiorowski closed NUTCH-142:
---
Fix Version: 0.7.2-dev
0.8-dev
Resolution: Fixed
NutchConf should use the thread context classloader
[ http://issues.apache.org/jira/browse/NUTCH-174?page=all ]
Piotr Kosiorowski closed NUTCH-174:
---
Fix Version: 0.7.2-dev
0.8-dev
Resolution: Fixed
Fixed some time ago during preparation of 0.7.2 release. Please use version
[ http://issues.apache.org/jira/browse/NUTCH-45?page=all ]
Piotr Kosiorowski closed NUTCH-45:
--
Fix Version: 0.7.2-dev
Resolution: Fixed
Applied. Thanks.
Log corrupt segments in SegmentMergeTool
[
http://issues.apache.org/jira/browse/NUTCH-79?page=comments#action_12364496 ]
Piotr Kosiorowski commented on NUTCH-79:
I think it should work without changes I suggested in previous comment - they
would be simply useful additions.
I was not using
[
http://issues.apache.org/jira/browse/NUTCH-225?page=comments#action_12369405 ]
Piotr Kosiorowski commented on NUTCH-225:
-
As stated in another thread I prefer to have a simple tutorial kept in version
control with releases.
We already have a
[ http://issues.apache.org/jira/browse/NUTCH-225?page=all ]
Piotr Kosiorowski closed NUTCH-225:
---
Resolution: Won't Fix
I have just updated Nutch Web site. It contains now both tutorials (for 0.7 and
0.8).
I have also added a notr to each
[ http://issues.apache.org/jira/browse/NUTCH-91?page=all ]
Piotr Kosiorowski closed NUTCH-91:
--
Fix Version: 0.7.2-dev
0.8-dev
Resolution: Fixed
Commited with small extension. Thanks.
empty encoding causes exception
[ http://issues.apache.org/jira/browse/NUTCH-239?page=all ]
Piotr Kosiorowski closed NUTCH-239:
---
Fix Version: 0.7.2-dev
Resolution: Fixed
Assign To: Piotr Kosiorowski
Applied with JavaDoc changes. Thanks.
I changed httpclient to use
[ http://issues.apache.org/jira/browse/NUTCH-94?page=all ]
Piotr Kosiorowski closed NUTCH-94:
--
Fix Version: 0.7.2-dev
Resolution: Duplicate
Assign To: Piotr Kosiorowski
Duplicate ofNUTCH-117.
MapFile.Writer throwing 'File exists
[ http://issues.apache.org/jira/browse/NUTCH-14?page=all ]
Piotr Kosiorowski closed NUTCH-14:
--
Resolution: Cannot Reproduce
Closed according to Stefan suggestion
NullPointerException NutchBean.getSummary
[ http://issues.apache.org/jira/browse/NUTCH-117?page=all ]
Piotr Kosiorowski closed NUTCH-117:
---
Fix Version: 0.7.2-dev
Resolution: Fixed
Assign To: Piotr Kosiorowski
Applied fixed by Mike. Also reported offlist by Michal Karwanski.
[ http://issues.apache.org/jira/browse/NUTCH-374?page=all ]
Piotr Kosiorowski reassigned NUTCH-374:
---
Assignee: Piotr Kosiorowski
when http.content.limit be set to -1 and Response.CONTENT_ENCODING is gzip
or x-gzip , it can not fetch any
[
https://issues.apache.org/jira/browse/NUTCH-429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Piotr Kosiorowski closed NUTCH-429.
---
Resolution: Invalid
Please use nutch-user mailing list for such questions and JIRA for
25 matches
Mail list logo