dated issues in JIRA

2011-06-07 Thread lewis john mcgibbney
Hi, I'm trying to get an idea of the type of issues which are currently being addressed and was looking through trivial issues in JIRA. There appear to be outstanding items from 2005, 2008 e.g. nutch-62 nutch-623 etc... I'm assuming that these aren't being assigned as they are of no interest to

[jira] [Commented] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-06-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13045524#comment-13045524 ] Lewis John McGibbney commented on NUTCH-623: Having checked branch-1.3

Re: dated issues in JIRA

2011-06-07 Thread lewis john mcgibbney
to all On Tue, Jun 7, 2011 at 6:22 PM, lewis john mcgibbney lewis.mcgibb...@gmail.com wrote: OK I will begin a basic clear out of redundant issues when my transition to project group status for JIRA is approved. Thanks On Tue, Jun 7, 2011 at 5:13 PM, Markus Jelsma markus.jel

Updating Wiki entries

2011-06-08 Thread lewis john mcgibbney
Hi, As 1.3 has just been released :0) I thought it appropriate to make an effort to upgrade some documentation in an attempt to drag parts of the wiki into the 1.3 era. In particular the following link is broken [1](404) The eclipse tutorial [2] needs updating slightly to accommodate new

Re: new branch 1.4 and possible features

2011-06-13 Thread lewis john mcgibbney
On Fri, Jun 10, 2011 at 12:11 PM, Markus Jelsma markus.jel...@openindex.iowrote: Guys, I added a new label 1.4 on the JIRA. Shall we create a new branch 1.4 on SVN from the existing 1.3? I agree that it is a pain to have to maintain 1.x AND trunk in parallel but my feeling is that 2.0

[jira] [Commented] (NUTCH-802) Problems managing outlinks with large url length

2011-06-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13052293#comment-13052293 ] Lewis John McGibbney commented on NUTCH-802: From recent user list

[jira] [Commented] (NUTCH-1000) Add option not to commit to Solr

2011-06-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054132#comment-13054132 ] Lewis John McGibbney commented on NUTCH-1000: - Hi Markus, I'm not on a work

[jira] [Updated] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

2011-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1019: Summary: Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

[jira] [Created] (NUTCH-1019) Edit comment in org.apache.nutc.crawl.Crawl to reflect removal of legacy

2011-06-27 Thread Lewis John McGibbney (JIRA)
Type: Improvement Components: documentation Affects Versions: 1.4, 2.0 Reporter: Lewis John McGibbney Priority: Trivial Fix For: 1.4, 2.0 When updating the wiki documentation for command line options, I noticed that the comment on line 51

[jira] [Created] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter

2011-06-28 Thread Lewis John McGibbney (JIRA)
Type: Task Components: linkdb Affects Versions: 1.3, 1.4, 2.0 Reporter: Lewis John McGibbney Fix For: 1.4, 2.0 Whilst updating the CommandLineOptions for release 1.3 on the wiki, I noticed that the above class does not exist in the expected location in /src

[jira] [Commented] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter

2011-06-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056344#comment-13056344 ] Lewis John McGibbney commented on NUTCH-1020: - I tagged this as linkdb (which

[jira] [Commented] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

2011-06-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056715#comment-13056715 ] Lewis John McGibbney commented on NUTCH-1019: - Yes I will do when I get home

[jira] [Created] (NUTCH-1023) Trivial error in error message for org.apache.nutch.crawl.LinkDbReader

2011-06-28 Thread Lewis John McGibbney (JIRA)
: Improvement Components: linkdb Affects Versions: 1.3 Reporter: Lewis John McGibbney Priority: Trivial Fix For: 1.4, 2.0 The following line in the above class has a trivial error in syntax before the -dump parameter. Instead of a curly bracket

[jira] [Commented] (NUTCH-1023) Trivial error in error message for org.apache.nutch.crawl.LinkDbReader

2011-06-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056750#comment-13056750 ] Lewis John McGibbney commented on NUTCH-1023: - I will submitt a patch

Re: Create separate issues for 2.0?

2011-06-30 Thread lewis john mcgibbney
To reply to your original point Markus, I agree with your suggestion. It occured to me recently that although many smaller issues apply to both branch 1.4 and trunk 2.0, the methods required to implement them on many occasions are different from a coding perspective therefore require different

[jira] [Commented] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter

2011-06-30 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13058075#comment-13058075 ] Lewis John McGibbney commented on NUTCH-1020: - I think you are correct Markus

Nutch 2.0 roadmap

2011-07-01 Thread lewis john mcgibbney
Hi, This is to all dev's although I am referring to Julien (as he established/last edited the wiki page) Currently the slightly (in places) dated roadmap can be found here [1], I was wondering if we could give this an overhaul/update as it would give a more robust overview of where trunk is

[jira] [Commented] (NUTCH-628) Host database to keep track of host-level information

2011-07-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13058993#comment-13058993 ] Lewis John McGibbney commented on NUTCH-628: From previous discussion

Rebuilding site

2011-07-07 Thread lewis john mcgibbney
Hi, As I am back home I propose to rebuild the site to link the current tutorial link to the new 1.3 tutorial on the wiki. I would also like to formally make my first committ by adding my name to the list of committers before I progress with other bits and pieces. Julien, I managed to pick out

Re: Rebuilding site

2011-07-07 Thread lewis john mcgibbney
Thanks Julien, I didn't even see this ticket. I'm on it. One further question, it would be interesting to unearth why people are subscribing to the nutch-user@ list. I am aware that this was the old list when Nutch was a subpart of Lucene. There is a heavily weighted tendency for people to cross

[jira] [Commented] (NUTCH-1043) Add pattern for filtering .js in default url filters

2011-07-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13064045#comment-13064045 ] Lewis John McGibbney commented on NUTCH-1043: - I think some discussion

Nutch benchmark results

2011-07-12 Thread lewis john mcgibbney
Hi, This is of trivial importance, but would be nice to display at forthcoming presentations/conferences etc. It concerns the following wiki entry [1], which I can only assume was a benchmark for a specific production implementation of Nutch. Unfortunately I am not able to access the .pdf,

[jira] [Commented] (NUTCH-1054) Make linkDB optional during indexing

2011-07-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066143#comment-13066143 ] Lewis John McGibbney commented on NUTCH-1054: - Just catching up on this one

[jira] [Commented] (NUTCH-1048) Busted links on http://nutch.apache.org/mailing_lists.html

2011-07-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066173#comment-13066173 ] Lewis John McGibbney commented on NUTCH-1048: - This affects more than one link

[jira] [Resolved] (NUTCH-916) Project Naming And Descriptions

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-916. Resolution: Fixed Assignee: Lewis John McGibbney Fixed as per ASF

[jira] [Closed] (NUTCH-915) project website basics

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-915. -- project website basics -- Key: NUTCH-915

[jira] [Closed] (NUTCH-917) Website Navigation Links

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-917. -- Website Navigation Links Key: NUTCH-917

[jira] [Assigned] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-1019: --- Assignee: Lewis John McGibbney Edit comment in org.apache.nutch.crawl.Crawl

[jira] [Updated] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1019: Attachment: crawl-comment.patch Patch to address the trivial task

Does i18n have a purpose anymore

2011-07-16 Thread lewis john mcgibbney
Hi, As above, I am not sure about the requirement for this internationalisation section on the wiki anymore... we are no longer viewing Nutch search pages other than the .jsp search pages shipped with = Nutch 1.2. Any views? -- *Lewis*

[jira] [Assigned] (NUTCH-672) allow unit tests to be run from bin/nutch

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-672: -- Assignee: Lewis John McGibbney allow unit tests to be run from bin/nutch

[jira] [Commented] (NUTCH-657) Estonian N-gram profile has wrong name

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066426#comment-13066426 ] Lewis John McGibbney commented on NUTCH-657: I have been unsuccessful

[jira] [Created] (NUTCH-1055) upgrade package.html file in language identifier plugin

2011-07-16 Thread Lewis John McGibbney (JIRA)
Components: documentation Affects Versions: 1.3 Reporter: Lewis John McGibbney Assignee: Lewis John McGibbney Priority: Minor Fix For: 1.4, 2.0 package.html within the language identifier plugin contains the following... however the link is broken

[jira] [Updated] (NUTCH-1055) upgrade package.html file in language identifier plugin

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1055: Attachment: NUTCH-1055-package-html.patch patch attached to update relative URL

[jira] [Updated] (NUTCH-1055) upgrade package.html file in language identifier plugin

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1055: Attachment: europarl.ps the attached document is referred to in package.html

Re: Does i18n have a purpose anymore

2011-07-16 Thread lewis john mcgibbney
I propose that we remove this from the main site and archive it to the legacy section of the wiki unless there are any objections. I will create a new JIRA ticket and address it in due course. Thanks On Sat, Jul 16, 2011 at 1:52 PM, Markus Jelsma markus.jel...@openindex.iowrote: I can't think

[jira] [Commented] (NUTCH-657) Estonian N-gram profile has wrong name

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066467#comment-13066467 ] Lewis John McGibbney commented on NUTCH-657: I opened a separate issue

[jira] [Created] (NUTCH-1056) Write a new plugin example for inclusion on the wiki

2011-07-16 Thread Lewis John McGibbney (JIRA)
: documentation Affects Versions: 1.4, 2.0 Reporter: Lewis John McGibbney Priority: Minor Fix For: 1.4, 2.0 It is important that we have a comprehensive plugin example for the current release of Nutch as packages and some classes have changed enough to create

[jira] [Closed] (NUTCH-16) boost documents matching a url pattern

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-16?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-16. - Resolution: Won't Fix Assignee: Lewis John McGibbney (was: Dennis Kubes) Agreed

Running individual test classes from nutch script cont'd

2011-07-17 Thread lewis john mcgibbney
Hi, OK this stems from discussion on the user@ list a while ago [1] and my discovery of NUTCH-672 yesterday. I attached a patch, which fails completely, as I hadn't uncovered things I now know. The original patch submitted for the issue would have been fine for =Nutch 1.2 but now as the file

adding details to mvn.template?

2011-07-17 Thread lewis john mcgibbney
Hi, Quick question, I've been looking at various issues dealt with prior to Nutch 1.3 release in particular NUTCH-995. Please excuse (and correct) my ignorance, but I need to clear this one up so I understand correctly. The purpose the mvn.template file serves is so we can specify exactly who

[jira] [Commented] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

2011-07-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066731#comment-13066731 ] Lewis John McGibbney commented on NUTCH-1019: - Committed at revision 1147712

[jira] [Updated] (NUTCH-1059) Remove convdb command from /bin/nutch

2011-07-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1059: Attachment: NUTCH-1059-remove-convdb.patch The patch simply removes both

[jira] [Closed] (NUTCH-1059) Remove convdb command from /bin/nutch

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1059. --- Committed and closed at revision 1147813 Remove convdb command from /bin/nutch

[jira] [Resolved] (NUTCH-1059) Remove convdb command from /bin/nutch

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1059. - Resolution: Fixed Fix Version/s: (was: 2.0) Remove convdb command

[jira] [Resolved] (NUTCH-1055) upgrade package.html file in language identifier plugin

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1055. - Resolution: Fixed upgrade package.html file in language identifier plugin

[jira] [Updated] (NUTCH-672) allow unit tests to be run from bin/nutch

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-672: --- Priority: Minor (was: Trivial) allow unit tests to be run from bin/nutch

[jira] [Closed] (NUTCH-1055) upgrade package.html file in language identifier plugin

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1055. --- Committed in revision 1147817 (trunk) Committed in revision 1147818 (branch-1.4

changing file and directory names

2011-07-18 Thread lewis john mcgibbney
Hi, We currently have two trivial issues e.g NUTCH-657https://issues.apache.org/jira/browse/NUTCH-657and NUTCH-623 https://issues.apache.org/jira/browse/NUTCH-623 both related at anabstract level to the same thing. They concern the name change of a file and plugin directory respectively. It would

[jira] [Commented] (NUTCH-1049) Add classes to bin/nutch

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066912#comment-13066912 ] Lewis John McGibbney commented on NUTCH-1049: - I would be happy to add

[jira] [Resolved] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1020. - Resolution: Fixed Fix Version/s: (was: 2.0) Assignee: Lewis

[jira] [Closed] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1020. --- Fixed and committed as NUTCH-1059 Remove convdb command from /bin/nutch (lewismc

[jira] [Commented] (NUTCH-881) Good quality documentation for Nutch

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066919#comment-13066919 ] Lewis John McGibbney commented on NUTCH-881: What is the current state

[jira] [Commented] (NUTCH-865) Format source code in unique style

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066927#comment-13066927 ] Lewis John McGibbney commented on NUTCH-865: My feelings are that this could

[jira] [Commented] (NUTCH-910) Cached.jsp has a bug with encoding

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066928#comment-13066928 ] Lewis John McGibbney commented on NUTCH-910: Mmmm... can we mark this as won't

[jira] [Commented] (NUTCH-1048) Busted links on http://nutch.apache.org/mailing_lists.html

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067139#comment-13067139 ] Lewis John McGibbney commented on NUTCH-1048: - Committed at revision 1147969

[jira] [Commented] (NUTCH-920) Project Metadata

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067217#comment-13067217 ] Lewis John McGibbney commented on NUTCH-920: A new file should be created

[jira] [Assigned] (NUTCH-920) Project Metadata

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-920: -- Assignee: Lewis John McGibbney Project Metadata

Re: Running individual test classes from nutch script cont'd

2011-07-19 Thread lewis john mcgibbney
didn't manage to get it running either. I've also trouble finding the test case class. bin/nutch junit.textui.TestRunner org.apache.nutch.parse.TestOutlinkExtractor Won't find the test class. Seem obvious but i've no idea how to run it from the /src/. On Sunday 17 July 2011 15:06:26 lewis john

[jira] [Commented] (NUTCH-1048) Busted links on http://nutch.apache.org/mailing_lists.html

2011-07-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067883#comment-13067883 ] Lewis John McGibbney commented on NUTCH-1048: - Thanks for this Julien

[jira] [Commented] (NUTCH-865) Format source code in unique style

2011-07-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067893#comment-13067893 ] Lewis John McGibbney commented on NUTCH-865: I'm happy to have a crack

[jira] [Commented] (NUTCH-865) Format source code in unique style

2011-07-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067950#comment-13067950 ] Lewis John McGibbney commented on NUTCH-865: agreed :0) Format source code

[jira] [Updated] (NUTCH-920) Project Metadata

2011-07-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-920: --- Attachment: doap_Apache_Nutch.rdf DOAP attachment. It does not contain any

[jira] [Commented] (NUTCH-919) Logos and Graphics

2011-07-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068995#comment-13068995 ] Lewis John McGibbney commented on NUTCH-919: So it looks like a new image

[jira] [Commented] (NUTCH-920) Project Metadata

2011-07-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069056#comment-13069056 ] Lewis John McGibbney commented on NUTCH-920: Committed @ revision 1149263

[jira] [Commented] (NUTCH-920) Project Metadata

2011-07-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069156#comment-13069156 ] Lewis John McGibbney commented on NUTCH-920: yes Julien I'll get it committed

[jira] [Updated] (NUTCH-920) Project Metadata

2011-07-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-920: --- Attachment: doap_Nutch_trunk.rdf DOAP file for Nutch 2.0 (trunk). Release date has

[jira] [Commented] (NUTCH-919) Logos and Graphics

2011-07-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069453#comment-13069453 ] Lewis John McGibbney commented on NUTCH-919: sorted and committed @ revision

[jira] [Commented] (NUTCH-920) Project Metadata

2011-07-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069462#comment-13069462 ] Lewis John McGibbney commented on NUTCH-920: If deemed suitable I could commit

[jira] [Created] (NUTCH-1065) New mvn.template

2011-07-22 Thread Lewis John McGibbney (JIRA)
New mvn.template Key: NUTCH-1065 URL: https://issues.apache.org/jira/browse/NUTCH-1065 Project: Nutch Issue Type: Task Components: build Affects Versions: 1.4, 2.0 Reporter: Lewis John McGibbney

[jira] [Created] (NUTCH-1066) trivial correction of

2011-07-22 Thread Lewis John McGibbney (JIRA)
: Lewis John McGibbney Assignee: Lewis John McGibbney Fix For: 1.4, 2.0 Trivial spelling correction in domain-urlfilter.txt in both trunk and branch-1.4 The attached patches simply correct this. -- This message is automatically generated by JIRA. For more information on JIRA

[jira] [Updated] (NUTCH-1066) trivial correction of domain-urlfilter.txt

2011-07-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1066: Summary: trivial correction of domain-urlfilter.txt (was: trivial correction

[jira] [Updated] (NUTCH-1066) trivial correction of

2011-07-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1066: Attachment: NUTCH-1066-domain-urlfilter-trivial-branch.patch NUTCH

[jira] [Resolved] (NUTCH-1066) trivial correction of domain-urlfilter.txt

2011-07-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1066. - Resolution: Fixed trivial correction of domain-urlfilter.txt

Correct Nutch tutorial

2011-07-28 Thread lewis john mcgibbney
Hi, Just been catching up with work over the last few days and noticed that there seems to be some confusion (to newer users) regarding the current Nutch tutorial. The link we have on the site points to [1], whilst many people still seem to somehow navigate their way to here [2]. I am positive

[jira] [Commented] (NUTCH-914) Implement Apache Project Branding Requirements

2011-07-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072417#comment-13072417 ] Lewis John McGibbney commented on NUTCH-914: How are we doing with this. As far

[jira] [Commented] (NUTCH-917) Website Navigation Links

2011-08-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13076174#comment-13076174 ] Lewis John McGibbney commented on NUTCH-917: Committed @ revision 1153108. I

[jira] [Commented] (NUTCH-208) http: proxy exception list:

2011-08-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13076212#comment-13076212 ] Lewis John McGibbney commented on NUTCH-208: This is an interesting

Re: Nutch 2 and Cassandra

2011-08-02 Thread lewis john mcgibbney
Hi I've been watching progress on this thread with interest and think that this would be a great addition to the wiki under the following page [1] I am happy to write it up, however is there anything else we need to be aware of in addition to the material you have provided, for example some

[jira] [Commented] (NUTCH-1049) Add classes to bin/nutch

2011-08-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079308#comment-13079308 ] Lewis John McGibbney commented on NUTCH-1049: - I'm glad to see that there have

[jira] [Closed] (NUTCH-1065) New mvn.template

2011-08-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1065. --- New mvn.template Key: NUTCH-1065

[jira] [Assigned] (NUTCH-1056) Write a new plugin example for inclusion on the wiki

2011-08-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-1056: --- Assignee: Lewis John McGibbney Write a new plugin example for inclusion

[jira] [Commented] (NUTCH-431) Move plugin specific properties out of nutch-site.xml and into specific conf files for plugins

2011-08-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079318#comment-13079318 ] Lewis John McGibbney commented on NUTCH-431: Can this issue be closed

Nutch 2.0 Documentation

2011-08-04 Thread lewis john mcgibbney
Hi, Was mucking around on a totally separate personal issue with Gora today and couldn't help but like the /docs directory which is bundled when you svn co the project. I would really like to push to get this going as per [1] as I have been trying to get various documentation updated over the

[jira] [Commented] (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool

2011-08-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080443#comment-13080443 ] Lewis John McGibbney commented on NUTCH-666: Chris excuse my naivety but I am

[jira] [Updated] (NUTCH-1035) Tune Solr config for Nutch users

2011-08-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1035: Attachment: solrconfig.xml Attached solrconfig.xml to get the ball rolling

[jira] [Commented] (NUTCH-717) Make Nutch Solr integration easier

2011-08-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080461#comment-13080461 ] Lewis John McGibbney commented on NUTCH-717: Are we to provide any support

[jira] [Commented] (NUTCH-713) Config options for webgraph Scoring not documented

2011-08-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080571#comment-13080571 ] Lewis John McGibbney commented on NUTCH-713: Is it deemed necessary to add

[jira] [Commented] (NUTCH-342) Nutch commands log to nutch/logs/hadoop.logs by default

2011-08-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080572#comment-13080572 ] Lewis John McGibbney commented on NUTCH-342: What is the current status

[jira] [Assigned] (NUTCH-208) http: proxy exception list:

2011-08-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-208: -- Assignee: Lewis John McGibbney http: proxy exception list

[jira] [Updated] (NUTCH-208) http: proxy exception list:

2011-08-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-208: --- Attachment: NUTCH-208-branch-1.4-20110807.patch Attached patch to be tested on branch

[jira] [Updated] (NUTCH-208) http: proxy exception list:

2011-08-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-208: --- Priority: Trivial (was: Minor) Patch Info: [Patch Available

[jira] [Assigned] (NUTCH-881) Good quality documentation for Nutch

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-881: -- Assignee: Lewis John McGibbney Good quality documentation for Nutch

[jira] [Commented] (NUTCH-881) Good quality documentation for Nutch

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081639#comment-13081639 ] Lewis John McGibbney commented on NUTCH-881: In Nutch trunk we currently only

[jira] [Assigned] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-623: -- Assignee: Lewis John McGibbney Change plugin source directory

[jira] [Commented] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081642#comment-13081642 ] Lewis John McGibbney commented on NUTCH-623: On second thoughts, and taking

[jira] [Commented] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081677#comment-13081677 ] Lewis John McGibbney commented on NUTCH-623: If we wished to fix

[jira] [Commented] (NUTCH-463) Nutch powerpoint parser plugin fails to parse ppt with images

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081695#comment-13081695 ] Lewis John McGibbney commented on NUTCH-463: Can we close this issue? .ppt

[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081703#comment-13081703 ] Lewis John McGibbney commented on NUTCH-978: If there has been a plugin written

[jira] [Commented] (NUTCH-342) Nutch commands log to nutch/logs/hadoop.logs by default

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081708#comment-13081708 ] Lewis John McGibbney commented on NUTCH-342: OK well I think that sets

  1   2   3   4   5   6   7   8   9   10   >