Author: snagel Date: Fri Jan 22 21:26:12 2016 New Revision: 1726314 URL: http://svn.apache.org/viewvc?rev=1726314&view=rev Log: NUTCH-2204 Remove junit lib from runtime
Modified: nutch/trunk/CHANGES.txt nutch/trunk/conf/regex-normalize.xml.template nutch/trunk/ivy/ivy.xml Modified: nutch/trunk/CHANGES.txt URL: http://svn.apache.org/viewvc/nutch/trunk/CHANGES.txt?rev=1726314&r1=1726313&r2=1726314&view=diff ============================================================================== --- nutch/trunk/CHANGES.txt (original) +++ nutch/trunk/CHANGES.txt Fri Jan 22 21:26:12 2016 @@ -1,5 +1,7 @@ Nutch Change Log +* NUTCH-2204 Remove junit lib from runtime (snagel) + * NUTCH-2201 Remove loops program from webgraph package (markus) * NUTCH-1325 HostDB for Nutch (Gui Forget, markus, tejasp) Modified: nutch/trunk/conf/regex-normalize.xml.template URL: http://svn.apache.org/viewvc/nutch/trunk/conf/regex-normalize.xml.template?rev=1726314&r1=1726313&r2=1726314&view=diff ============================================================================== --- nutch/trunk/conf/regex-normalize.xml.template (original) +++ nutch/trunk/conf/regex-normalize.xml.template Fri Jan 22 21:26:12 2016 @@ -39,11 +39,12 @@ <substitution>/$3</substitution> </regex> --> -<!-- removes interpage href anchors such as site.com#location --> +<!-- removes interpage href anchors such as site.com#location + (also done by urlnormalizer-basic <regex> - <pattern>#.*?(\?|&|$)</pattern> + <pattern>#.*</pattern> <substitution>$1</substitution> -</regex> +</regex> --> <!-- cleans ?&var=value into ?var=value --> <regex> Modified: nutch/trunk/ivy/ivy.xml URL: http://svn.apache.org/viewvc/nutch/trunk/ivy/ivy.xml?rev=1726314&r1=1726313&r2=1726314&view=diff ============================================================================== --- nutch/trunk/ivy/ivy.xml (original) +++ nutch/trunk/ivy/ivy.xml Fri Jan 22 21:26:12 2016 @@ -92,6 +92,7 @@ <dependency org="org.netpreserve.commons" name="webarchive-commons" rev="1.1.5" conf="*->default"> <exclude module="hadoop-core"/> <exclude org="com.google.guava"/> + <exclude org="junit"/> </dependency> <!--artifacts needed for testing -->