[jira] Resolved: (NUTCH-805) Unable to resolve the url-blah-blah, skipping

2010-06-26 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-805. - Fix Version/s: 1.2 Resolution: Incomplete - the amount of detail required to track

[jira] Commented: (NUTCH-834) Separate the Nutch web site from trunk

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12883141#action_12883141 ] Chris A. Mattmann commented on NUTCH-834: - Hey Julien: My recommendation would be

[jira] Updated: (NUTCH-363) Fetcher normalizes everything at least twice

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-363: Fix Version/s: 2.0 (was: 1.2) Fetcher normalizes everything at

[jira] Updated: (NUTCH-833) Website is still Lucene branded

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-833: Fix Version/s: 2.0 (was: 1.2) Website is still Lucene branded

[jira] Updated: (NUTCH-50) Benchmarks Performance goals

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-50?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-50: --- Fix Version/s: 2.0 (was: 1.2) Benchmarks Performance goals

[jira] Updated: (NUTCH-832) Website menu has lots of broken links - in particular the API docs

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-832: Fix Version/s: 2.0 (was: 1.2) Website menu has lots of broken links

[jira] Updated: (NUTCH-831) Allow configuration of how fields crawled by Nutch are stored / indexed / tokenized

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-831: Fix Version/s: 2.0 (was: 1.2) Allow configuration of how fields

[jira] Commented: (NUTCH-831) Allow configuration of how fields crawled by Nutch are stored / indexed / tokenized

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12883415#action_12883415 ] Chris A. Mattmann commented on NUTCH-831: - I applied this patch to the Nutch 1.2

[jira] Resolved: (NUTCH-831) Allow configuration of how fields crawled by Nutch are stored / indexed / tokenized

2010-06-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-831. - Assignee: Chris A. Mattmann Fix Version/s: (was: 2.0) Resolution:

[jira] Commented: (NUTCH-774) Retry interval in crawl date is set to 0

2010-06-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12883541#action_12883541 ] Chris A. Mattmann commented on NUTCH-774: - Yep, it's still open Alex. I meant

[jira] Work started: (NUTCH-838) Add timing information to all Tool classes

2010-07-01 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-838 started by Chris A. Mattmann. Add timing information to all Tool classes -- Key: NUTCH-838

[jira] Commented: (NUTCH-837) Remove search servers and Lucene dependencies

2010-07-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12884691#action_12884691 ] Chris A. Mattmann commented on NUTCH-837: - Hey Julien: How are we going to replace

[jira] Commented: (NUTCH-837) Remove search servers and Lucene dependencies

2010-07-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12884712#action_12884712 ] Chris A. Mattmann commented on NUTCH-837: - I'm not sure I agree :) The Nutch

[jira] Commented: (NUTCH-837) Remove search servers and Lucene dependencies

2010-07-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12884718#action_12884718 ] Chris A. Mattmann commented on NUTCH-837: - Hey Julien, Yep that's the point. Solr

[jira] Created: (NUTCH-841) Nutch 2.0 webapp

2010-07-02 Thread Chris A. Mattmann (JIRA)
Nutch 2.0 webapp Key: NUTCH-841 URL: https://issues.apache.org/jira/browse/NUTCH-841 Project: Nutch Issue Type: Improvement Components: web gui Environment: Nutch 2.0 Reporter: Chris A.

[jira] Commented: (NUTCH-837) Remove search servers and Lucene dependencies

2010-07-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12884731#action_12884731 ] Chris A. Mattmann commented on NUTCH-837: - Okey dok, I created NUTCH-841 to track

[jira] Updated: (NUTCH-838) Add timing information to all Tool classes

2010-07-03 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-838: Fix Version/s: 1.2 I'll backport this to the 1.2 branch as well. Add timing information

[jira] Resolved: (NUTCH-838) Add timing information to all Tool classes

2010-07-03 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-838. - Resolution: Fixed - Patch applied to trunk in r960246 and backported to 1.2-branch in

[jira] Commented: (NUTCH-696) Timeout for Parser

2010-07-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12885290#action_12885290 ] Chris A. Mattmann commented on NUTCH-696: - Hey Guys, Why don't we flow this patch

[jira] Commented: (NUTCH-821) Use ivy in nutch builds

2010-07-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12885291#action_12885291 ] Chris A. Mattmann commented on NUTCH-821: - Guys, Why have any libs in the lib dir

[jira] Commented: (NUTCH-696) Timeout for Parser

2010-07-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12885299#action_12885299 ] Chris A. Mattmann commented on NUTCH-696: - I hear ya! Welp, +1 to commit, no

[jira] Commented: (NUTCH-696) Timeout for Parser

2010-07-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12885313#action_12885313 ] Chris A. Mattmann commented on NUTCH-696: - Hey Ken: +1, please file a Tika issue

[jira] Commented: (NUTCH-821) Use ivy in nutch builds

2010-07-06 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12885547#action_12885547 ] Chris A. Mattmann commented on NUTCH-821: - Hi Julien: I reviewed your patch, and am

[jira] Commented: (NUTCH-843) Separate the build and runtime environments

2010-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12885967#action_12885967 ] Chris A. Mattmann commented on NUTCH-843: - Super +1 I've wanted to do something

[jira] Commented: (NUTCH-843) Separate the build and runtime environments

2010-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12886012#action_12886012 ] Chris A. Mattmann commented on NUTCH-843: - Hey Andrzej: Wouldn't my proposed

[jira] Commented: (NUTCH-848) Error when calling 'nutch solrindex' in deployed configuration

2010-07-13 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12887792#action_12887792 ] Chris A. Mattmann commented on NUTCH-848: - Hey Julien: +1, the trunk can't be

[jira] Commented: (NUTCH-853) Remove unused parameter files from conf/

2010-07-14 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12888405#action_12888405 ] Chris A. Mattmann commented on NUTCH-853: - Hrm: I'm not sold on context.xsl being

[jira] Commented: (NUTCH-853) Remove unused parameter files from conf/

2010-07-14 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12888415#action_12888415 ] Chris A. Mattmann commented on NUTCH-853: - Meh, potentially. I think having the XSL

[jira] Issue Comment Edited: (NUTCH-853) Remove unused parameter files from conf/

2010-07-14 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12888415#action_12888415 ] Chris A. Mattmann edited comment on NUTCH-853 at 7/14/10 12:19 PM:

[jira] Assigned: (NUTCH-825) Publish nutch artifacts to central maven repository

2010-07-14 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-825: --- Assignee: Chris A. Mattmann Publish nutch artifacts to central maven repository

[jira] Resolved: (NUTCH-759) Removal of deprecated APIs

2010-07-14 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-759. - Assignee: Chris A. Mattmann Resolution: Incomplete This issue isn't clear at all.

[jira] Commented: (NUTCH-677) Segment merge filering based on segment content

2010-07-14 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12888530#action_12888530 ] Chris A. Mattmann commented on NUTCH-677: - Hi Marcin, I applied your patch, and was

[jira] Assigned: (NUTCH-871) MoreIndexingFilter missing date format

2010-08-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-871: --- Assignee: Chris A. Mattmann MoreIndexingFilter missing date format

[jira] Work started: (NUTCH-871) MoreIndexingFilter missing date format

2010-08-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-871 started by Chris A. Mattmann. MoreIndexingFilter missing date format -- Key: NUTCH-871

[jira] Commented: (NUTCH-863) Benchmark and a testbed proxy server

2010-08-03 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12894944#action_12894944 ] Chris A. Mattmann commented on NUTCH-863: - Hey Andrzej, Oddly enough, this test

[jira] Commented: (NUTCH-863) Benchmark and a testbed proxy server

2010-08-03 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12895125#action_12895125 ] Chris A. Mattmann commented on NUTCH-863: - okey dok, I cleaned this up in r982102,

[jira] Commented: (NUTCH-858) No longer able to set per-field boosts on lucene documents

2010-08-03 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12895126#action_12895126 ] Chris A. Mattmann commented on NUTCH-858: - Hey Andrzej, do you know what rev this

[jira] Commented: (NUTCH-863) Benchmark and a testbed proxy server

2010-08-04 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12895259#action_12895259 ] Chris A. Mattmann commented on NUTCH-863: - Hey Guys: Just wanted to let you know

[jira] Commented: (NUTCH-858) No longer able to set per-field boosts on lucene documents

2010-08-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12895819#action_12895819 ] Chris A. Mattmann commented on NUTCH-858: - +1! No longer able to set per-field

[jira] Resolved: (NUTCH-855) ScoringFilter and IndexingFilter: To allow for the propagation of URL Metatags and their subsequent indexing.

2010-08-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-855. - Resolution: Fixed My preference is that rather than reopen issues (which is a real pain

[jira] Commented: (NUTCH-855) ScoringFilter and IndexingFilter: To allow for the propagation of URL Metatags and their subsequent indexing.

2010-08-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12896271#action_12896271 ] Chris A. Mattmann commented on NUTCH-855: - updated the docs with your new comments

[jira] Work started: (NUTCH-870) Injector should add the metadata before calling injectedScore

2010-08-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-870 started by Chris A. Mattmann. Injector should add the metadata before calling injectedScore

[jira] Assigned: (NUTCH-870) Injector should add the metadata before calling injectedScore

2010-08-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-870: --- Assignee: Chris A. Mattmann (was: Julien Nioche) Injector should add the metadata

[jira] Updated: (NUTCH-864) Fetcher generates entries with status 0

2010-08-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-864: Fix Version/s: 2.0 (was: nutchbase) Affects Version/s:

[jira] Commented: (NUTCH-859) Diff trunk and NutchBase

2010-08-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12896293#action_12896293 ] Chris A. Mattmann commented on NUTCH-859: - note since I'm about to create branch-1.3

[jira] Work started: (NUTCH-873) Ivy configuration settings don't include Gora

2010-08-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-873 started by Chris A. Mattmann. Ivy configuration settings don't include Gora - Key:

[jira] Created: (NUTCH-873) Ivy configuration settings don't include Gora

2010-08-07 Thread Chris A. Mattmann (JIRA)
Ivy configuration settings don't include Gora - Key: NUTCH-873 URL: https://issues.apache.org/jira/browse/NUTCH-873 Project: Nutch Issue Type: Bug Components: build Environment:

[jira] Resolved: (NUTCH-873) Ivy configuration settings don't include Gora

2010-08-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-873. - Resolution: Fixed - fixed in r983322. Ivy configuration settings don't include Gora

[jira] Resolved: (NUTCH-564) External parser supports encoding attribute

2010-08-08 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-564. - Fix Version/s: 2.0 Resolution: Fixed - patch applied in r983472. Thanks Antony!

[jira] Created: (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2010-08-08 Thread Chris A. Mattmann (JIRA)
Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora -- Key: NUTCH-874 URL: https://issues.apache.org/jira/browse/NUTCH-874 Project: Nutch Issue

[jira] Commented: (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2010-08-09 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12896552#action_12896552 ] Chris A. Mattmann commented on NUTCH-874: - Hey Julien, I think Jukka already worked

[jira] Commented: (NUTCH-878) ScoringFilters should not override the injected score

2010-08-10 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12896879#action_12896879 ] Chris A. Mattmann commented on NUTCH-878: - +1 from me, Julien, thanks!

[jira] Commented: (NUTCH-877) Allow setting of slop values for non-quote phrase queries on query-basic plugin

2010-08-10 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12896889#action_12896889 ] Chris A. Mattmann commented on NUTCH-877: - +1 from me too on this Dennis. Commit

[jira] Commented: (NUTCH-650) Hbase Integration

2010-08-10 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12896939#action_12896939 ] Chris A. Mattmann commented on NUTCH-650: - +1, this should be wrapped up. Hbase

[jira] Commented: (NUTCH-811) Develop an ORM framework

2010-08-11 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897322#action_12897322 ] Chris A. Mattmann commented on NUTCH-811: - +1, close it out... Develop an ORM

[jira] Commented: (NUTCH-881) Good quality documentation for Nutch

2010-08-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12898052#action_12898052 ] Chris A. Mattmann commented on NUTCH-881: - {quote} I'm happy to learn about this and

[jira] Commented: (NUTCH-887) Delegate parsing of feeds to Tika

2010-08-15 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12898706#action_12898706 ] Chris A. Mattmann commented on NUTCH-887: - bq. Ah, good - I missed that, I need to

[jira] Commented: (NUTCH-891) Nutch build should not depend on unversioned local deps

2010-08-19 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12900283#action_12900283 ] Chris A. Mattmann commented on NUTCH-891: - Hi Andrzej: Can I get some

[jira] Commented: (NUTCH-891) Nutch build should not depend on unversioned local deps

2010-08-19 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12900290#action_12900290 ] Chris A. Mattmann commented on NUTCH-891: - bq. Sure, that would solve the problem

[jira] Commented: (NUTCH-891) Nutch build should not depend on unversioned local deps

2010-08-24 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12901953#action_12901953 ] Chris A. Mattmann commented on NUTCH-891: - +1. Great patch, Enis, I think we can use

[jira] Commented: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable

2010-09-08 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907275#action_12907275 ] Chris A. Mattmann commented on NUTCH-407: - Hmmm: I agree here. If no one objects in

[jira] Created: (NUTCH-905) Configurable file protocol parent directory crawling

2010-09-10 Thread Chris A. Mattmann (JIRA)
Configurable file protocol parent directory crawling Key: NUTCH-905 URL: https://issues.apache.org/jira/browse/NUTCH-905 Project: Nutch Issue Type: Improvement Components:

[jira] Work started: (NUTCH-905) Configurable file protocol parent directory crawling

2010-09-10 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-905 started by Chris A. Mattmann. Configurable file protocol parent directory crawling

[jira] Resolved: (NUTCH-905) Configurable file protocol parent directory crawling

2010-09-10 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-905. - Resolution: Fixed - patch for NUTCH-407 applied to 2.0 trunk in r996045, and in

[jira] Commented: (NUTCH-882) Design a Host table in GORA

2010-09-15 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909727#action_12909727 ] Chris A. Mattmann commented on NUTCH-882: - Hey Doğacan: +1 to introducing a

[jira] Work started: (NUTCH-908) Infinite Loop and Null Pointer Bugs in Searching

2010-09-16 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-908 started by Chris A. Mattmann. Infinite Loop and Null Pointer Bugs in Searching Key:

[jira] Resolved: (NUTCH-908) Infinite Loop and Null Pointer Bugs in Searching

2010-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-908. - Resolution: Fixed - applied to branch-1.2 in r998587. Thanks Dennis! I'll roll a new RC

[jira] Updated: (NUTCH-909) Add alternative search-provider to Nutch site

2010-09-20 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-909: Fix Version/s: 2.0 - set fix version Add alternative search-provider to Nutch site

[jira] Assigned: (NUTCH-909) Add alternative search-provider to Nutch site

2010-09-20 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-909: --- Assignee: Chris A. Mattmann Add alternative search-provider to Nutch site

[jira] Assigned: (NUTCH-901) Make index-more plug-in configurable

2010-09-20 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-901: --- Assignee: Chris A. Mattmann Make index-more plug-in configurable

[jira] Updated: (NUTCH-901) Make index-more plug-in configurable

2010-09-20 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-901: Fix Version/s: 1.2 - fix for 1.2 as well (sigh, this means *another* RC). Oh well, for the

[jira] Resolved: (NUTCH-901) Make index-more plug-in configurable

2010-09-20 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-901. - Resolution: Fixed - patch applied to trunk in r999181 and to branch-1.2 in r999200.

[jira] Resolved: (NUTCH-577) Use explicit tika-config.xml file to enable mime magic detection to be turned on and off

2010-09-26 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-577. - Fix Version/s: 2.0 Resolution: Fixed - it's been 3 years since I've reported this

[jira] Updated: (NUTCH-910) Cached.jsp has a bug with encoding

2010-09-27 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-910: Fix Version/s: (was: 1.0.0) unset fix version -- 1.0.0 has already been released and

[jira] Commented: (NUTCH-921) Reduce dependency of Nutch on config files

2010-10-19 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12922616#action_12922616 ] Chris A. Mattmann commented on NUTCH-921: - Hey Andrzej: see NUTCH-431: super +1 on

[jira] Work started: (NUTCH-714) Need a SFTP and SCP Protocol Handler

2010-10-23 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-714 started by Chris A. Mattmann. Need a SFTP and SCP Protocol Handler Key: NUTCH-714

[jira] Resolved: (NUTCH-73) A page for CSV results

2010-10-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-73?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-73. Resolution: Won't Fix With SOLR-1925, we get this same functionality for free. Thanks for

[jira] Resolved: (NUTCH-825) Publish nutch artifacts to central maven repository

2010-10-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-825. - Resolution: Fixed - fix committed to trunk in r1028294. The fix includes the following

[jira] Issue Comment Edited: (NUTCH-825) Publish nutch artifacts to central maven repository

2010-10-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12925811#action_12925811 ] Chris A. Mattmann edited comment on NUTCH-825 at 10/28/10 9:55 AM:

[jira] [Commented] (NUTCH-386) Plugin to index categories by url rules

2011-04-15 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13020556#comment-13020556 ] Chris A. Mattmann commented on NUTCH-386: - Hi Richard, Thanks, would love to have

[jira] [Resolved] (NUTCH-984) Parse-tika throws some URL's away

2011-04-22 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-984. - Resolution: Won't Fix Looks like this is a Tika issue. If not, please let someone know or

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-05-17 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034761#comment-13034761 ] Chris A. Mattmann commented on NUTCH-995: - Step 1 - woot - patch applies

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-05-17 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034760#comment-13034760 ] Chris A. Mattmann commented on NUTCH-995: - Hey Julien, I'll test this today and

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-05-17 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034762#comment-13034762 ] Chris A. Mattmann commented on NUTCH-995: - OK, I'm getting an error when running

[jira] [Work started] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-06-04 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-995 started by Chris A. Mattmann. Generate POM file using the Ivy makepom task - Key:

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-06-04 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13044339#comment-13044339 ] Chris A. Mattmann commented on NUTCH-995: - Latest patch from Julien fails on the

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-06-04 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13044343#comment-13044343 ] Chris A. Mattmann commented on NUTCH-995: - Thanks Jul, that helps. I'll try and

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-06-04 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13044346#comment-13044346 ] Chris A. Mattmann commented on NUTCH-995: - Added mvn.template in r1131455. Working

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-06-04 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13044350#comment-13044350 ] Chris A. Mattmann commented on NUTCH-995: - Finishing touches applied in r1131458. I

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-06-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13044547#comment-13044547 ] Chris A. Mattmann commented on NUTCH-995: - Hi Gabriele, Hmm...If you have a look

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-06-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13044548#comment-13044548 ] Chris A. Mattmann commented on NUTCH-995: - Looks like there was a fairly recent IVY

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-06-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13044579#comment-13044579 ] Chris A. Mattmann commented on NUTCH-995: - Hi Gabriele, thanks. Nutch is not

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-06-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13044581#comment-13044581 ] Chris A. Mattmann commented on NUTCH-995: - bq. Sure, it's just a wish for standards

[jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-06-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13044607#comment-13044607 ] Chris A. Mattmann commented on NUTCH-995: - I've gone ahead and created a wiki page

[jira] [Commented] (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool

2011-08-06 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080462#comment-13080462 ] Chris A. Mattmann commented on NUTCH-666: - Hi Lewis, That's fine with me. My

[jira] [Commented] (NUTCH-940) static field plugin

2011-09-10 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102154#comment-13102154 ] Chris A. Mattmann commented on NUTCH-940: - Thumbs up. Please commit! No need to

[jira] [Created] (NUTCH-1526) Create SegmentContentDumperTool for easily extracting out file contents from SegmentDirs

2013-01-27 Thread Chris A. Mattmann (JIRA)
Chris A. Mattmann created NUTCH-1526: Summary: Create SegmentContentDumperTool for easily extracting out file contents from SegmentDirs Key: NUTCH-1526 URL: https://issues.apache.org/jira/browse/NUTCH-1526

[jira] [Updated] (NUTCH-1526) Create SegmentContentDumperTool for easily extracting out file contents from SegmentDirs

2013-01-27 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-1526: - Description: It only took me 1.2 years, but I finally got around to it. This patch will

[jira] [Created] (NUTCH-1539) Implement the Hypertext Induced Topic Search (HITS) algorithm in Nutch

2013-03-04 Thread Chris A. Mattmann (JIRA)
Chris A. Mattmann created NUTCH-1539: Summary: Implement the Hypertext Induced Topic Search (HITS) algorithm in Nutch Key: NUTCH-1539 URL: https://issues.apache.org/jira/browse/NUTCH-1539

[jira] [Work started] (NUTCH-1539) Implement the Hypertext Induced Topic Search (HITS) algorithm in Nutch

2013-03-04 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-1539 started by Chris A. Mattmann. Implement the Hypertext Induced Topic Search (HITS) algorithm in Nutch

  1   2   3   4   5   6   7   >