[ http://issues.apache.org/jira/browse/NUTCH-65?page=all ] Andrzej Bialecki closed NUTCH-65: ----------------------------------
Resolution: Fixed Patches applied. Thanks! > index-more plugin can't parse large set of modification-date > ------------------------------------------------------------- > > Key: NUTCH-65 > URL: http://issues.apache.org/jira/browse/NUTCH-65 > Project: Nutch > Type: Bug > Components: indexer > Environment: nutch 0.7, java 1.5, linux > Reporter: Lutischán Ferenc > > I found a problem in MoreIndexingFilter.java. > When I indexing segments, I get large list of error messages: > can't parse errorenous date: Wed, 10 Sep 2003 11:59:14 or > can't parse errorenous date: Wed, 10 Sep 2003 11:59:14GMT > I modifiing source code (I don't make a 'patch'): > Original (lines 137-138): > DateFormat df = new SimpleDateFormat("EEE MMM dd HH:mm:ss yyyy zzz"); > Date d = df.parse(date); > New: > DateFormat df = new SimpleDateFormat("EEE, MMM dd HH:mm:ss yyyy", Locale.US); > Date d = df.parse(date.substring(0,25)); > The modified code works fine. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira