GitHub user jeremie70 opened a pull request:
https://github.com/apache/nutch/pull/92
Add the boilerpipe parsing adapted from NUTCH-961
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/jeremie70/nutch my-branch
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/nutch/pull/92.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #92
----
commit f185bc4461c57a1a85578de0ecf0884c7026c3a6
Author: Jérémie Bourseau <[email protected]>
Date: 2016-02-26T10:37:28Z
improve parser with boilerpipe
commit 93ea2e51f444447be41ec93b2c0b0b61c117eeb3
Author: Jérémie Bourseau <[email protected]>
Date: 2016-02-26T10:37:28Z
NUTCH-961 improve parser with boilerpipe
commit be91764fdf59d4f6930fc3211a84a252e5452674
Author: Jérémie Bourseau <[email protected]>
Date: 2016-02-26T11:00:36Z
Merge branch 'my-branch' of https://github.com/jeremie70/nutch into
my-branch
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---