GitHub user roberttjahjadi opened a pull request:
https://github.com/apache/nutch/pull/75
Trunk
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/apache/nutch trunk
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/nutch/pull/75.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #75
----
commit d8fd479cb7f967bb412c71ec0736aa43f1dcad62
Author: Julien Nioche <[email protected]>
Date: 2014-06-13T11:17:26Z
NUTCH-1647 protocol-http throws 'unzipBestEffort returned null' for
redirected pages (jnioche)
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1602375
13f79535-47bb-0310-9956-ffa450edef68
commit 67e29cf34cf36cb9639ac83d08251dba9251f64e
Author: Julien Nioche <[email protected]>
Date: 2014-06-17T08:41:57Z
NUTCH-1793 HttpRobotRulesParser not configured properly (jnioche)
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1603094
13f79535-47bb-0310-9956-ffa450edef68
commit 827e6762369e12850e76ae5f344c24052079a7a5
Author: Julien Nioche <[email protected]>
Date: 2014-06-17T14:11:23Z
NUTCH-1590 [SECURITY] Frame injection vulnerability in published Javadoc
(jnioche)
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1603179
13f79535-47bb-0310-9956-ffa450edef68
commit 87af23b1ba4bbd003fe4bc56687023e3baaf373a
Author: Markus Jelsma <[email protected]>
Date: 2014-06-17T14:23:49Z
NUTCH-1794 IndexingFilterChecker to optionally dumpText
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1603185
13f79535-47bb-0310-9956-ffa450edef68
commit c9a91f527cac841f5e9355e0d8aac67b2aa3cd6a
Author: Sebastian Nagel <[email protected]>
Date: 2014-06-20T22:15:43Z
NUTCH-1718 redefine http.robots.agent as "additional agent names"
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1604291
13f79535-47bb-0310-9956-ffa450edef68
commit 02fda7de9b7d27c65a8ff8f14fbbf769623fc3e2
Author: Sebastian Nagel <[email protected]>
Date: 2014-06-20T22:56:32Z
NUTCH-1767 remove special treatment of "params" in relative links
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1604298
13f79535-47bb-0310-9956-ffa450edef68
commit 32be5a6c69b9fe41127ad17c752e446e6b61a291
Author: Sebastian Nagel <[email protected]>
Date: 2014-06-24T21:41:28Z
NUTCH-1787 update and complete API doc overview page
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1605204
13f79535-47bb-0310-9956-ffa450edef68
commit c74c89393bf86090e4da162786be54188483f977
Author: Julien Nioche <[email protected]>
Date: 2014-06-25T11:01:38Z
NUTCH-1633 slf4j is provided by hadoop and should not be included in the
job file (kaveh minooie via jnioche)
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1605331
13f79535-47bb-0310-9956-ffa450edef68
commit 5b724bfcd4a8eb7cba9bb6427ccb47efd851482e
Author: Julien Nioche <[email protected]>
Date: 2014-06-27T07:38:45Z
NUTCH-385 Improve description of thread related configuration for Fetcher
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1605978
13f79535-47bb-0310-9956-ffa450edef68
commit 4106ad092ad34bba3c4a31837d71042a3dfc71ce
Author: Julien Nioche <[email protected]>
Date: 2014-06-30T12:38:58Z
NUTCH 1803 Put test dependencies in a separate lib dir
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1606715
13f79535-47bb-0310-9956-ffa450edef68
commit 376cc5f66161b9533bf0e82537f4c5b9a0195979
Author: Julien Nioche <[email protected]>
Date: 2014-06-30T13:40:06Z
NUTCH-1802 Move TestbedProxy to test environment (jnioche)
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1606730
13f79535-47bb-0310-9956-ffa450edef68
commit b2e2fe5b6087fd0a5c58cb9d50ec6d362c985747
Author: Sebastian Nagel <[email protected]>
Date: 2014-07-04T20:15:12Z
add dependency "init" (calling "ivy-init") to "compile-core-test" to fix
nightly build failures introduced with NUTCH-1803
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1607929
13f79535-47bb-0310-9956-ffa450edef68
commit 9cc0f7b47d69bd9f0c4fc8d821d491d61f17479a
Author: Sebastian Nagel <[email protected]>
Date: 2014-07-05T20:36:33Z
NUTCH-1605 MIME type detector recognizes xlsx as zip file
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1608130
13f79535-47bb-0310-9956-ffa450edef68
commit 282af5ef4fc3eda5d603cd8500a431eadb8d5eea
Author: Sebastian Nagel <[email protected]>
Date: 2014-07-05T21:13:19Z
NUTCH-1566 bin/nutch to allow whitespace in paths
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1608135
13f79535-47bb-0310-9956-ffa450edef68
commit 248e317afd4f38569f60ccfad1387b19f621c837
Author: Sebastian Nagel <[email protected]>
Date: 2014-07-05T21:42:20Z
NUTCH-1776 Log incorrect plugin.folder file path
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1608136
13f79535-47bb-0310-9956-ffa450edef68
commit 20b24c2f12500e2ea8b09e4d3bb3d5044cab89f1
Author: Julien Nioche <[email protected]>
Date: 2014-07-07T12:38:23Z
NUTCH-578 URL fetched with 403 is generated over and over again
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1608431
13f79535-47bb-0310-9956-ffa450edef68
commit a2334386e7c1e78baa078b139624ac1c1d19ce20
Author: Julien Nioche <[email protected]>
Date: 2014-07-09T14:01:20Z
NUTCH-1799 ANT Eclipse task discovers all plugin jars automatically
(jnioche)
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1609158
13f79535-47bb-0310-9956-ffa450edef68
commit 1d73b5a2b741dbd937bae94bc2f22f0e47173856
Author: Sebastian Nagel <[email protected]>
Date: 2014-07-10T20:50:27Z
NUTCH-1811 bin/nutch junit to use junit 4 test runner
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1609568
13f79535-47bb-0310-9956-ffa450edef68
commit c23d27e429f2844369a77988289547f35a68fa31
Author: Julien Nioche <[email protected]>
Date: 2014-07-15T08:39:16Z
NUTCH-1804 Move JUnit dependency to test scope
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1610624
13f79535-47bb-0310-9956-ffa450edef68
commit 08476fccd8e88c7744be963f58323daa1fd66649
Author: Julien Nioche <[email protected]>
Date: 2014-07-15T09:03:24Z
Eclipse task gets test dependencies after NUTCH-1803
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1610627
13f79535-47bb-0310-9956-ffa450edef68
commit bdcbfa5b29ff669bce6792778af60b812dd23dfd
Author: Julien Nioche <[email protected]>
Date: 2014-07-15T09:16:47Z
NUTCH-1502 Test for CrawlDatum state transitions (snagel)
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1610628
13f79535-47bb-0310-9956-ffa450edef68
commit 1b46ce6cad851eef37d33eb290a4aac49449f84c
Author: Julien Nioche <[email protected]>
Date: 2014-07-15T09:34:38Z
NUTCH-1422 Bypass signature comparison when a document is redirected
(snagel)
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1610631
13f79535-47bb-0310-9956-ffa450edef68
commit 91cc99828a0c6c5d1ee7c9ee9ae321fb439386a3
Author: Julien Nioche <[email protected]>
Date: 2014-07-15T10:18:49Z
build : resolve-test calls init task so that the ivy jar gets imported
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1610635
13f79535-47bb-0310-9956-ffa450edef68
commit 914c5be48bc829d3667abb0f9cf776b1e1f661fb
Author: Julien Nioche <[email protected]>
Date: 2014-07-15T11:32:32Z
NUTCH-926 Redirections from META tag don't get filtered
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1610659
13f79535-47bb-0310-9956-ffa450edef68
commit e463fbc1a6293307faf6a9744023ab3717134f05
Author: Julien Nioche <[email protected]>
Date: 2014-07-16T10:11:01Z
NUTCH-1817 Remove pom.xml from source (jnioche)
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1610956
13f79535-47bb-0310-9956-ffa450edef68
commit 461dd9d52cb3a7bd482f200d0267114eb86bc06b
Author: Julien Nioche <[email protected]>
Date: 2014-07-17T09:17:38Z
Wrong task called in deps-jar for urlfilter-* plugins prevents ant runtime
from working
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1611303
13f79535-47bb-0310-9956-ffa450edef68
commit f062d2b540b6b4a251bdac3cd73a57b8c0b2fc39
Author: Julien Nioche <[email protected]>
Date: 2014-07-17T12:42:01Z
NUTCH-1818 Add deps-test-compile task for building plugins
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1611343
13f79535-47bb-0310-9956-ffa450edef68
commit bad0a2076a8c724a0542b923ac10bb812c0de644
Author: Sebastian Nagel <[email protected]>
Date: 2014-07-29T15:13:20Z
NUTCH-1708 use same id when indexing and deleting redirects
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1614375
13f79535-47bb-0310-9956-ffa450edef68
commit 8c6c916ba63fda36d4dfe40a266af3dc4553465b
Author: Julien Nioche <[email protected]>
Date: 2014-07-30T08:55:24Z
NUTCH-1561 improve usability of parse-metatags and index-metadata
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1614586
13f79535-47bb-0310-9956-ffa450edef68
commit 18714f89fc7b809708e8a8ed51a432516577e59c
Author: Lewis John McGibbney <[email protected]>
Date: 2014-08-18T20:40:06Z
Create new development versions in Trunk
git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1618725
13f79535-47bb-0310-9956-ffa450edef68
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---