Hi,

according to the error message the RSS feed abc.xml
is truncated or invalid. Could you check the property
"http.content.limit", default is 64kB, large RSS feeds
may get truncated.

You can test parsing itself by

% bin/nutch parsechecker 'http://www.###.###.##/###/abc.xml'

Cheers,
Sebastian


On 10/03/2013 08:15 PM, A Laxmi wrote:
> I got this error and researching on it doesn't seem to help much.  Please
> help.
> 
> *Error tika.TikaParser - Error parsing http://www.###.###.##/###/abc.xml
> 
> *
> *org.apache.tika.exception.TikaException: RSS parse error
> 
> *
> *Caused by com.sun.syndication.io.ParsingFeedException: Invalid XML: Error
> on line 436: The element "item" must be terminated by the matching end-tag
> "</item>"*
> 

Reply via email to