svn commit: r1499684 - in /nutch/trunk: CHANGES.txt src/java/org/apache/nutch/crawl/Injector.java

2013-07-04 Thread markus
Author: markus Date: Thu Jul 4 08:50:25 2013 New Revision: 1499684 URL: http://svn.apache.org/r1499684 Log: NUTCH-1600 Injector overwrite does not always work properly Modified: nutch/trunk/CHANGES.txt nutch/trunk/src/java/org/apache/nutch/crawl/Injector.java Modified:

svn commit: r1499696 - in /nutch/trunk: CHANGES.txt src/plugin/headings/src/java/org/apache/nutch/parse/headings/HeadingsParseFilter.java

2013-07-04 Thread markus
Author: markus Date: Thu Jul 4 09:07:12 2013 New Revision: 1499696 URL: http://svn.apache.org/r1499696 Log: NUTCH-1597 HeadingsParseFilter to trim and remove exess whitespace Modified: nutch/trunk/CHANGES.txt

svn commit: r1499722 - in /nutch/trunk: CHANGES.txt src/plugin/headings/src/java/org/apache/nutch/parse/headings/HeadingsParseFilter.java

2013-07-04 Thread markus
Author: markus Date: Thu Jul 4 11:13:34 2013 New Revision: 1499722 URL: http://svn.apache.org/r1499722 Log: NUTCH-1596 HeadingsParseFilter not thread safe Modified: nutch/trunk/CHANGES.txt nutch/trunk/src/plugin/headings/src/java/org/apache/nutch/parse/headings/HeadingsParseFilter.java

svn commit: r1499779 - in /nutch/trunk: CHANGES.txt src/java/org/apache/nutch/crawl/CrawlDatum.java

2013-07-04 Thread fenglu
Author: fenglu Date: Thu Jul 4 15:07:13 2013 New Revision: 1499779 URL: http://svn.apache.org/r1499779 Log: NUTCH-1602 improve the readability of metadata in readdb dump normal Modified: nutch/trunk/CHANGES.txt nutch/trunk/src/java/org/apache/nutch/crawl/CrawlDatum.java Modified: