[ http://issues.apache.org/jira/browse/NUTCH-65?page=all ]
Michael Nebel updated NUTCH-65:
-------------------------------
Attachment: MoreIndexingFilter.diff
commons-lang-2.1.jar
MoreIndexingFilter.java
As Jerome suggested, I changed the function getTime() to use the DateUtils from
commons-lang .
> index-more plugin can't parse large set of modification-date
> -------------------------------------------------------------
>
> Key: NUTCH-65
> URL: http://issues.apache.org/jira/browse/NUTCH-65
> Project: Nutch
> Type: Bug
> Components: indexer
> Versions: 0.7, 0.8-dev
> Environment: nutch 0.7, java 1.5, linux
> Reporter: Lutischán Ferenc
> Fix For: 0.8-dev
> Attachments: MoreIndexingFilter.diff, MoreIndexingFilter.java,
> commons-lang-2.1.jar
>
> I found a problem in MoreIndexingFilter.java.
> When I indexing segments, I get large list of error messages:
> can't parse errorenous date: Wed, 10 Sep 2003 11:59:14 or
> can't parse errorenous date: Wed, 10 Sep 2003 11:59:14GMT
> I modifiing source code (I don't make a 'patch'):
> Original (lines 137-138):
> DateFormat df = new SimpleDateFormat("EEE MMM dd HH:mm:ss yyyy zzz");
> Date d = df.parse(date);
> New:
> DateFormat df = new SimpleDateFormat("EEE, MMM dd HH:mm:ss yyyy", Locale.US);
> Date d = df.parse(date.substring(0,25));
> The modified code works fine.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers