[jira] Updated: (NUTCH-251) Administration GUI

2006-07-25 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-251?page=all ] Sami Siren updated NUTCH-251: - Fix Version/s: 0.9-dev (was: 0.8-dev) Administration GUI -- Key: NUTCH-251 URL:

[jira] Updated: (NUTCH-318) log4j not proper configured, readdb doesnt give any information

2006-07-25 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-318?page=all ] Sami Siren updated NUTCH-318: - Fix Version/s: 0.9-dev (was: 0.8-dev) log4j not proper configured, readdb doesnt give any information

[jira] Updated: (NUTCH-322) Fetcher discards ProtocolStatus, doesn't store redirected pages

2006-07-25 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-322?page=all ] Sami Siren updated NUTCH-322: - Fix Version/s: 0.9-dev (was: 0.8-dev) Fetcher discards ProtocolStatus, doesn't store redirected pages

[jira] Updated: (NUTCH-262) Summary excerpts and highlights problems

2006-07-25 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-262?page=all ] Sami Siren updated NUTCH-262: - Fix Version/s: 0.9-dev (was: 0.8-dev) Summary excerpts and highlights problems

[jira] Updated: (NUTCH-233) wrong regular expression hang reduce process for ever

2006-07-25 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-233?page=all ] Sami Siren updated NUTCH-233: - Fix Version/s: 0.9-dev (was: 0.8-dev) wrong regular expression hang reduce process for ever

[jira] Updated: (NUTCH-247) robot parser to restrict.

2006-07-25 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-247?page=all ] Sami Siren updated NUTCH-247: - Fix Version/s: 0.9-dev (was: 0.8-dev) robot parser to restrict. - Key: NUTCH-247

[jira] Updated: (NUTCH-325) UrlFilters.java throws NPE in case urlfilter.order contains Filters that are not in plugin.includes

2006-07-25 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-325?page=all ] Sami Siren updated NUTCH-325: - Fix Version/s: 0.9-dev (was: 0.8-dev) UrlFilters.java throws NPE in case urlfilter.order contains Filters that are not in plugin.includes

[jira] Commented: (NUTCH-266) hadoop bug when doing updatedb

2006-07-23 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-266?page=comments#action_12422929 ] Sami Siren commented on NUTCH-266: -- I finally found the time to setup an environment with cygwin and try this out. I can confirm that the hadoop.jar version

[jira] Created: (NUTCH-327) bin/nutch setting of log path problems on cygwin

2006-07-23 Thread Sami Siren (JIRA)
bin/nutch setting of log path problems on cygwin Key: NUTCH-327 URL: http://issues.apache.org/jira/browse/NUTCH-327 Project: Nutch Issue Type: Bug Affects Versions: 0.8-dev

[jira] Resolved: (NUTCH-327) bin/nutch setting of log path problems on cygwin

2006-07-23 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-327?page=all ] Sami Siren resolved NUTCH-327. -- Resolution: Fixed bin/nutch setting of log path problems on cygwin Key: NUTCH-327

[jira] Created: (NUTCH-328) commons-cli-2.0-SNAPSHOT.jar provided with nutch is not compatible with jdk 1.4

2006-07-23 Thread Sami Siren (JIRA)
commons-cli-2.0-SNAPSHOT.jar provided with nutch is not compatible with jdk 1.4 --- Key: NUTCH-328 URL: http://issues.apache.org/jira/browse/NUTCH-328 Project: Nutch

[jira] Resolved: (NUTCH-328) commons-cli-2.0-SNAPSHOT.jar provided with nutch is not compatible with jdk 1.4

2006-07-23 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-328?page=all ] Sami Siren resolved NUTCH-328. -- Resolution: Fixed updated library commons-cli-2.0-SNAPSHOT.jar provided with nutch is not compatible with jdk 1.4

[jira] Commented: (NUTCH-293) support for Crawl-delay in Robots.txt

2006-07-18 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-293?page=comments#action_12421930 ] Sami Siren commented on NUTCH-293: -- perhaps instead of delay = crawlDelay 0 ? crawlDelay : serverDelay; we could do delay=Math.max(crawlDelay, serverDelay);

[jira] Created: (NUTCH-320) DmozParser does not output urls to stdout

2006-07-17 Thread Sami Siren (JIRA)
DmozParser does not output urls to stdout - Key: NUTCH-320 URL: http://issues.apache.org/jira/browse/NUTCH-320 Project: Nutch Issue Type: Bug Affects Versions: 0.8-dev Reporter: Sami

[jira] Resolved: (NUTCH-320) DmozParser does not output urls to stdout

2006-07-17 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-320?page=all ] Sami Siren resolved NUTCH-320. -- Resolution: Fixed DmozParser does not output urls to stdout - Key: NUTCH-320 URL:

[jira] Resolved: (NUTCH-172) Segment merger

2006-07-11 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-172?page=all ] Sami Siren resolved NUTCH-172: -- Fix Version: 0.8-dev Resolution: Fixed Assign To: Andrzej Bialecki this has allready been implemented by ab mergesegs Segment merger

[jira] Resolved: (NUTCH-306) DistributedSearch.Client liveAddresses concurrency problem

2006-06-27 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-306?page=all ] Sami Siren resolved NUTCH-306: -- Fix Version: 0.8-dev Resolution: Fixed just committed this, thanks Grant! DistributedSearch.Client liveAddresses concurrency problem

[jira] Assigned: (NUTCH-110) OpenSearchServlet outputs illegal xml characters

2006-06-20 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-110?page=all ] Sami Siren reassigned NUTCH-110: Assign To: Sami Siren OpenSearchServlet outputs illegal xml characters Key: NUTCH-110

[jira] Commented: (NUTCH-110) OpenSearchServlet outputs illegal xml characters

2006-06-20 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-110?page=comments#action_12416932 ] Sami Siren commented on NUTCH-110: -- in method addAttribute(...) line: attribute.setValue(getLegalXml(getLegalXml(value))); intentional? OpenSearchServlet outputs illegal

[jira] Resolved: (NUTCH-302) java doc of CrawlDb is wrong

2006-06-20 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-302?page=all ] Sami Siren resolved NUTCH-302: -- Resolution: Fixed Assign To: Sami Siren java doc of CrawlDb is wrong Key: NUTCH-302 URL:

[jira] Resolved: (NUTCH-166) secure jobtracker info pages with a password

2006-06-20 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-166?page=all ] Sami Siren resolved NUTCH-166: -- Resolution: Won't Fix this is hadoop related secure jobtracker info pages with a password Key:

[jira] Resolved: (NUTCH-110) OpenSearchServlet outputs illegal xml characters

2006-06-20 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-110?page=all ] Sami Siren resolved NUTCH-110: -- Fix Version: 0.8-dev Resolution: Fixed I just committed this with small changes (moved test to a test case) thanks. OpenSearchServlet outputs illegal

[jira] Resolved: (NUTCH-292) OpenSearchServlet: OutOfMemoryError: Java heap space

2006-06-20 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-292?page=all ] Sami Siren resolved NUTCH-292: -- Fix Version: 0.8-dev Resolution: Fixed Assign To: Sami Siren I just committed this, thank you! OpenSearchServlet: OutOfMemoryError: Java heap

[jira] Resolved: (NUTCH-156) nutch-daemon.sh should not overwrite old logs by default

2006-06-20 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-156?page=all ] Sami Siren resolved NUTCH-156: -- Resolution: Won't Fix i quess the logging is now handled differently, so old logs sre not overwritten anymore nutch-daemon.sh should not overwrite old logs

[jira] Commented: (NUTCH-180) Performance problem with widely used keywords

2006-06-20 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-180?page=comments#action_12416979 ] Sami Siren commented on NUTCH-180: -- There's a naive caching implementation under contrib/web2/plugins wich one might try out and improve Performance problem with widely

[jira] Commented: (NUTCH-306) DistributedSearch.Client liveAddresses concurrency problem

2006-06-18 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-306?page=comments#action_12416673 ] Sami Siren commented on NUTCH-306: -- This patch does not seem to apply anymore, can you please attach a patch against current svn trunk. DistributedSearch.Client

[jira] Assigned: (NUTCH-306) DistributedSearch.Client liveAddresses concurrency problem

2006-06-15 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-306?page=all ] Sami Siren reassigned NUTCH-306: Assign To: Sami Siren DistributedSearch.Client liveAddresses concurrency problem -- Key:

[jira] Resolved: (NUTCH-122) block numbers need a better random number generator

2006-06-15 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-122?page=all ] Sami Siren resolved NUTCH-122: -- Resolution: Invalid this is more related to hadoop block numbers need a better random number generator ---

[jira] Closed: (NUTCH-187) Cannot start Nutch datanodes on Windows outside of a cygwin environment because of DF

2006-06-15 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-187?page=all ] Sami Siren closed NUTCH-187: Resolution: Won't Fix closed as requested Cannot start Nutch datanodes on Windows outside of a cygwin environment because of DF

[jira] Commented: (NUTCH-48) Did you mean query enhancement/refignment feature request

2006-06-06 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-48?page=comments#action_12415016 ] Sami Siren commented on NUTCH-48: - stefan, I tried to apply your combined patch but it seems that the test case does not compile. Did you mean query enhancement/refignment

[jira] Resolved: (NUTCH-201) add support for subcollections

2006-06-05 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-201?page=all ] Sami Siren resolved NUTCH-201: -- Resolution: Fixed just committed this add support for subcollections -- Key: NUTCH-201 URL:

[jira] Resolved: (NUTCH-280) url query causes NullPointerException

2006-05-23 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-280?page=all ] Sami Siren resolved NUTCH-280: -- Fix Version: 0.8-dev Resolution: Fixed Assign To: Sami Siren fixed in trunk, thanks for reporting this url query causes NullPointerException

[jira] Resolved: (NUTCH-221) prepare nutch for upcoming lucene 2.0

2006-03-05 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-221?page=all ] Sami Siren resolved NUTCH-221: -- Resolution: Fixed committed prepare nutch for upcoming lucene 2.0 - Key: NUTCH-221 URL:

[jira] Updated: (NUTCH-221) prepare nutch for upcoming lucene 2.0

2006-03-03 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-221?page=all ] Sami Siren updated NUTCH-221: - Attachment: nutch-lucene-deprecation.txt prepare nutch for upcoming lucene 2.0 - Key: NUTCH-221 URL:

[jira] Resolved: (NUTCH-137) footer is not displayed in search result page

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-137?page=all ] Sami Siren resolved NUTCH-137: -- Fix Version: 0.8-dev Resolution: Fixed fixed as related to NUTCH-81 footer is not displayed in search result page

[jira] Closed: (NUTCH-123) Cache.jsp some times generate NullPointerException

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-123?page=all ] Sami Siren closed NUTCH-123: Fix Version: 0.8-dev Resolution: Duplicate problem reported to be fixed in NUTCH-135 Cache.jsp some times generate NullPointerException

[jira] Resolved: (NUTCH-64) no results after a restart of a search--server (without tomcat restart)

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-64?page=all ] Sami Siren resolved NUTCH-64: - Resolution: Duplicate duplicate with NUTCH-14 no results after a restart of a search--server (without tomcat restart)

[jira] Resolved: (NUTCH-90) reduce logging output of IndexSegment

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-90?page=all ] Sami Siren resolved NUTCH-90: - Resolution: Invalid doesn't seem to apply anymore reduce logging output of IndexSegment - Key: NUTCH-90

[jira] Resolved: (NUTCH-200) OpenSearch Servlet ist broken

2006-02-06 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-200?page=all ] Sami Siren resolved NUTCH-200: -- Fix Version: 0.8-dev Resolution: Fixed this is now fixed, thanks OpenSearch Servlet ist broken - Key: NUTCH-200

[jira] Assigned: (NUTCH-81) Webapp only works when deployed in root

2006-02-06 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-81?page=all ] Sami Siren reassigned NUTCH-81: --- Assign To: Sami Siren Webapp only works when deployed in root --- Key: NUTCH-81 URL:

[jira] Assigned: (NUTCH-178) in search.jsp must be session creation false

2006-02-03 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-178?page=all ] Sami Siren reassigned NUTCH-178: Assign To: Sami Siren in search.jsp must be session creation false -- Key: NUTCH-178 URL:

[jira] Created: (NUTCH-201) add support for subcollections

2006-02-03 Thread Sami Siren (JIRA)
add support for subcollections -- Key: NUTCH-201 URL: http://issues.apache.org/jira/browse/NUTCH-201 Project: Nutch Type: New Feature Versions: 0.8-dev Reporter: Sami Siren Assigned to: Sami Siren Priority: Minor

[jira] Updated: (NUTCH-201) add support for subcollections

2006-02-03 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-201?page=all ] Sami Siren updated NUTCH-201: - Attachment: subcollections-1.patch add support for subcollections -- Key: NUTCH-201 URL:

[jira] Commented: (NUTCH-193) move NDFS and MapReduce to a separate project

2006-01-31 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-193?page=comments#action_12364663 ] Sami Siren commented on NUTCH-193: -- +1 I quess the fuse-j - ndfs work from John/me could be part of hadoop /contrib after this change? move NDFS and MapReduce to a

[jira] Commented: (NUTCH-44) too many search results

2006-01-31 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-44?page=comments#action_12364679 ] Sami Siren commented on NUTCH-44: - Byron, have you made any progress with this? too many search results --- Key: NUTCH-44 URL:

[jira] Resolved: (NUTCH-146) mapred.job.tracker.info.port is defined 2 times in the nutch-default.xml

2005-12-20 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-146?page=all ] Sami Siren resolved NUTCH-146: -- Fix Version: 0.8-dev Resolution: Fixed Assign To: Sami Siren mapred.job.tracker.info.port is defined 2 times in the nutch-default.xml

[jira] Resolved: (NUTCH-145) build of war file fails on Chinese (zh) .xml files due to UTF-8 BOM

2005-12-20 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-145?page=all ] Sami Siren resolved NUTCH-145: -- Fix Version: 0.8-dev Resolution: Fixed Assign To: Sami Siren this is now committed, thanks build of war file fails on Chinese (zh) .xml files due

<    1   2   3   4   5