[jira] [Commented] (NUTCH-2263) Support for mingram and maxgram at Unigram Cosine Similarity Model

2016-05-19 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292542#comment-15292542 ] Hudson commented on NUTCH-2263: --- SUCCESS: Integrated in Nutch-trunk #3370 (See [https://bui

Jenkins build is back to normal : Nutch-trunk #3370

2016-05-19 Thread Apache Jenkins Server
See

[jira] [Commented] (NUTCH-2264) Check Forbidden API's at Build

2016-05-19 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292529#comment-15292529 ] ASF GitHub Bot commented on NUTCH-2264: --- Github user lewismc commented on a diff in

[GitHub] nutch pull request: NUTCH-2264 Check Forbidden API's at Build

2016-05-19 Thread lewismc
Github user lewismc commented on a diff in the pull request: https://github.com/apache/nutch/pull/115#discussion_r63980262 --- Diff: build.xml --- @@ -1035,4 +1039,11 @@ + + + + --- End diff -- yes I think we sho

[jira] [Updated] (NUTCH-2263) Support for mingram and maxgram at Unigram Cosine Similarity Model

2016-05-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2263: Fix Version/s: (was: 2.4.1) 1.12 > Support for mingram and ma

[jira] [Commented] (NUTCH-2263) Support for mingram and maxgram at Unigram Cosine Similarity Model

2016-05-19 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292526#comment-15292526 ] ASF GitHub Bot commented on NUTCH-2263: --- Github user asfgit closed the pull request

[jira] [Resolved] (NUTCH-2263) Support for mingram and maxgram at Unigram Cosine Similarity Model

2016-05-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2263. - Resolution: Fixed Assignee: Furkan KAMACI Thank you [~kamaci] nice patch >

[GitHub] nutch pull request: NUTCH-2263 Support for mingram and maxgram at ...

2016-05-19 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/nutch/pull/114 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabl

[jira] [Updated] (NUTCH-2122) Implement Javadoc package-info.java for webui packages

2016-05-19 Thread Furkan KAMACI (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Furkan KAMACI updated NUTCH-2122: - Summary: Implement Javadoc package-info.java for webui packages (was: Implement Javadoc package-i

[jira] [Commented] (NUTCH-2264) Check Forbidden API's at Build

2016-05-19 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292473#comment-15292473 ] ASF GitHub Bot commented on NUTCH-2264: --- Github user kamaci commented on a diff in t

[GitHub] nutch pull request: NUTCH-2264 Check Forbidden API's at Build

2016-05-19 Thread kamaci
Github user kamaci commented on a diff in the pull request: https://github.com/apache/nutch/pull/115#discussion_r63977002 --- Diff: build.xml --- @@ -1035,4 +1039,11 @@ + + + + --- End diff -- Here is a list for

[jira] [Updated] (NUTCH-2122) Implement Javadoc package-info.html for webui packages

2016-05-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2122: Summary: Implement Javadoc package-info.html for webui packages (was: Implement Jav

[jira] [Commented] (NUTCH-2122) Implement Javadoc package.html for webui packages

2016-05-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292454#comment-15292454 ] Lewis John McGibbney commented on NUTCH-2122: - I agree :) > Implement Javadoc

[GitHub] nutch pull request: NUTCH-2264 Check Forbidden API's at Build

2016-05-19 Thread lewismc
Github user lewismc commented on a diff in the pull request: https://github.com/apache/nutch/pull/115#discussion_r63976670 --- Diff: build.xml --- @@ -1035,4 +1039,11 @@ + + + + --- End diff -- Are there no other

[jira] [Commented] (NUTCH-2264) Check Forbidden API's at Build

2016-05-19 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292453#comment-15292453 ] ASF GitHub Bot commented on NUTCH-2264: --- Github user lewismc commented on a diff in

[jira] [Commented] (NUTCH-1084) ReadDB url throws exception

2016-05-19 Thread kaveh minooie (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292406#comment-15292406 ] kaveh minooie commented on NUTCH-1084: -- has there been any update on this issue? I am

[jira] [Comment Edited] (NUTCH-2122) Implement Javadoc package.html for webui packages

2016-05-19 Thread Furkan KAMACI (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292388#comment-15292388 ] Furkan KAMACI edited comment on NUTCH-2122 at 5/19/16 11:59 PM:

[jira] [Commented] (NUTCH-2122) Implement Javadoc package.html for webui packages

2016-05-19 Thread Furkan KAMACI (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292388#comment-15292388 ] Furkan KAMACI commented on NUTCH-2122: -- [~lewismc] we may also prefer package-info.ja

[jira] [Commented] (NUTCH-2264) Check Forbidden API's at Build

2016-05-19 Thread Furkan KAMACI (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292360#comment-15292360 ] Furkan KAMACI commented on NUTCH-2264: -- This is an initial PR due to I get that error

[GitHub] nutch pull request: NUTCH-2264 Check Forbidden API's at Build

2016-05-19 Thread kamaci
GitHub user kamaci opened a pull request: https://github.com/apache/nutch/pull/115 NUTCH-2264 Check Forbidden API's at Build Forbidden APIs is checked at ant build. You can merge this pull request into a Git repository by running: $ git pull https://github.com/kamaci/nutch NUTC

[jira] [Commented] (NUTCH-2264) Check Forbidden API's at Build

2016-05-19 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15292355#comment-15292355 ] ASF GitHub Bot commented on NUTCH-2264: --- GitHub user kamaci opened a pull request:

[jira] [Created] (NUTCH-2264) Check Forbidden API's at Build

2016-05-19 Thread Furkan KAMACI (JIRA)
Furkan KAMACI created NUTCH-2264: Summary: Check Forbidden API's at Build Key: NUTCH-2264 URL: https://issues.apache.org/jira/browse/NUTCH-2264 Project: Nutch Issue Type: Task Affects Ver

Re: Breaking Change Note in CHANGES.txt

2016-05-19 Thread Lewis John Mcgibbney
Thanks Markus I appreciate the response. I'll push the release candidate now. On Tue, May 17, 2016 at 9:46 AM, Lewis John Mcgibbney < lewis.mcgibb...@gmail.com> wrote: > Hi Folks, > What is going on with the note in CHANGES.txt? [0] I've pasted it below > for convenience. > Did I miss some convo

[jira] [Updated] (NUTCH-2122) Implement Javadoc package.html for webui packages

2016-05-19 Thread Furkan KAMACI (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Furkan KAMACI updated NUTCH-2122: - Summary: Implement Javadoc package.html for webui packages (was: Implement Javadoc package.html f

[jira] [Commented] (NUTCH-1858) Migrate Nutch documentation from Moin Moin to Confluence

2016-05-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15291595#comment-15291595 ] Lewis John McGibbney commented on NUTCH-1858: - AFAIK a script or two exist to

[jira] [Commented] (NUTCH-1858) Migrate Nutch documentation from Moin Moin to Confluence

2016-05-19 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15291591#comment-15291591 ] Sebastian Nagel commented on NUTCH-1858: It's hardly a work for a single person. F

[jira] [Commented] (NUTCH-1858) Migrate Nutch documentation from Moin Moin to Confluence

2016-05-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15291400#comment-15291400 ] Lewis John McGibbney commented on NUTCH-1858: - I honestly do no know. This is

[jira] [Commented] (NUTCH-2122) Implement Javadoc package.html for service packages

2016-05-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15291395#comment-15291395 ] Lewis John McGibbney commented on NUTCH-2122: - Hi Furkan, please change the 's

[Nutch Wiki] Trivial Update of "bin/nutch generate" by SebastianNagel

2016-05-19 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "bin/nutch generate" page has been changed by SebastianNagel: https://wiki.apache.org/nutch/bin/nutch%20generate?action=diff&rev1=6&rev2=7 Comment: fix format of last edit nutch-

[Nutch Wiki] Update of "bin/nutch generate" by SebastianNagel

2016-05-19 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "bin/nutch generate" page has been changed by SebastianNagel: https://wiki.apache.org/nutch/bin/nutch%20generate?action=diff&rev1=5&rev2=6 Comment: Add hint about number of reducers

[jira] [Commented] (NUTCH-2122) Implement Javadoc package.html for service packages

2016-05-19 Thread Furkan KAMACI (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15291035#comment-15291035 ] Furkan KAMACI commented on NUTCH-2122: -- [~lewismc] it seems that this package is remo

[Nutch Wiki] Update of "bin/nutch generate" by SebastianNagel

2016-05-19 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "bin/nutch generate" page has been changed by SebastianNagel: https://wiki.apache.org/nutch/bin/nutch%20generate?action=diff&rev1=4&rev2=5 Comment: Add information about scope (per s

[jira] [Commented] (NUTCH-1858) Migrate Nutch documentation from Moin Moin to Confluence

2016-05-19 Thread Furkan KAMACI (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15290986#comment-15290986 ] Furkan KAMACI commented on NUTCH-1858: -- [~lewismc] I can work on this issue. What sho

[jira] [Updated] (NUTCH-2164) Inconsistent 'Modified Time' in crawl db

2016-05-19 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2164: --- Fix Version/s: 1.13 > Inconsistent 'Modified Time' in crawl db > -

Re: A powerful Charset Encoding Detector plugin for Nutch

2016-05-19 Thread Sebastian Nagel
Hi Shabanali, thanks for your offer! And sorry for the late reply. Currently, in Nutch charset detection is not plugabble. Because encoding is an integral part of document formats It's a task for the parser because it's really tight to document formats, and does work really different, e.g. for HT