[
https://issues.apache.org/jira/browse/NUTCH-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15864429#comment-15864429
]
ASF GitHub Bot commented on NUTCH-2359:
---------------------------------------
GitHub user laknath opened a pull request:
https://github.com/apache/nutch/pull/178
NUTCH-2359 RegexParseFilter: ill-formed rules raise error
- README
- Using any whitespace character(\s) instead of tab(\t) as rule delimiter
Issue: https://issues.apache.org/jira/browse/NUTCH-2359
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/laknath/nutch NUTCH-2359
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/nutch/pull/178.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #178
----
commit 70bc1dac14c254d090927d2ce8d67bb9747bb8d0
Author: Laknath <[email protected]>
Date: 2017-02-13T01:48:46Z
NUTCH-2359 RegexParseFilter: ill-formed rules raise error
- README
- Using any whitespace character(\s) instead of tab(\t) as rule delimiter
----
> Parsefilter-regex raises IndexOutOfBoundsException when rules are ill-formed
> ----------------------------------------------------------------------------
>
> Key: NUTCH-2359
> URL: https://issues.apache.org/jira/browse/NUTCH-2359
> Project: Nutch
> Issue Type: Bug
> Components: plugin
> Affects Versions: 1.12
> Reporter: Laknath Semage
> Priority: Minor
> Labels: patch
> Fix For: 1.13
>
>
> This patch fixes:
> 1) [Bug] Parsefilter-regex raises IndexOutOfBoundsException when rules are
> ill-formed
> 2) Rules are split using any space character (\s) instead tab (\t)
> 3) A detailed Readme for the plugin
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)