Missed this in the console:
[Fatal Error] :115:120: The element type "HR" must be terminated by the matching
end-tag "</HR>".
org.apache.lcf.core.interfaces.LCFException: XML parsing error: The element type
"HR" must be terminated by the matching end-tag "</HR>".
at org.apache.lcf.core.common.XMLDoc.init(XMLDoc.java:369)
at org.apache.lcf.core.common.XMLDoc.<init>(XMLDoc.java:317)
at org.apache.lcf.agents.output.solr.HttpPoster.getResponse(HttpPoster.j
ava:537)
So there's a parsing error for the XML.
From: Jens Bengtsson [mailto:[email protected]]
Sent: den 21 juli 2010 15:36
To: [email protected]
Subject: RE: java.lang.NullPointerException while trying to crawl RSS feed to
Solr
So I updated and this is the error I get in the log now:
Service interruption reported for job 1279719088042 connection 'Giantbomb RSS':
Error 500 from ingestion request; ingestion will be retried again later
From: Jens Bengtsson [mailto:[email protected]]
Sent: den 21 juli 2010 14:41
To: [email protected]
Subject: RE: java.lang.NullPointerException while trying to crawl RSS feed to
Solr
No worries!
I'm very thankful for your help.
Jens
From: [email protected] [mailto:[email protected]]
Sent: den 21 juli 2010 14:34
To: [email protected]
Subject: RE: java.lang.NullPointerException while trying to crawl RSS feed to
Solr
Yesterday should be fine. I overlooked something and have checked in a fix.
My apologies.
Karl
From: ext Jens Bengtsson [mailto:[email protected]]
Sent: Wednesday, July 21, 2010 8:26 AM
To: [email protected]
Subject: RE: java.lang.NullPointerException while trying to crawl RSS feed to
Solr
That's strange because I did the checkout from
https://svn.apache.org/repos/asf/incubator/lcf/trunk yesterday and I did a
update today and rebuilt everything so things should be in sync with trunk.
Jens
From: [email protected] [mailto:[email protected]]
Sent: den 21 juli 2010 13:55
To: [email protected]
Subject: RE: java.lang.NullPointerException while trying to crawl RSS feed to
Solr
Hi Jens,
The trace you gave me is out of date wrt trunk by at least a month. Would you
be willing to synch up to the latest LCF, and see how you do with that? If you
still see a trace, I'd be happy to analyze it and perhaps check in a patch.
Karl
From: Wright Karl (Nokia-MS/Cambridge)
Sent: Wednesday, July 21, 2010 6:51 AM
To: [email protected]; [email protected]
Subject: RE: java.lang.NullPointerException while trying to crawl RSS feed to
Solr
The 'connection working' from rss doesn't mean much. But the 'connection
working' from solr means that lcf could talk to solr and do a ping.
In any case, you should never see an NPE from lcf, so I am going to look into
this at earliest opportunity. It is possible that the NPE is masking some
other error, but maybe it is just broken.
Karl
--- original message ---
From: "ext Jens Bengtsson" <[email protected]>
Subject: java.lang.NullPointerException while trying to crawl RSS feed to Solr
Date: July 21, 2010
Time: 6:38:7 AM
Hi!
I have setup a connector against a RSS-feed with output to a Solr server. The
repository connection and output connection report that the connection is ok.
When I run the job it seems to retrieve the RSS feed and process everything as
it should, the data does not seem to get indexed into Solr however.
If I look in the lcf log file I find the following:
Error tossed: null
java.lang.NullPointerException
at
org.apache.lcf.agents.output.solr.HttpPoster$IngestThread.run(HttpPoster.java:976)
I guess there's an error when it tries to post the data to Solr, but I can't
figure out what the problem is. If I look at the catalina log for the tomcat
where Solr is run I can't find any errors or anything else.
Does anyone have any tips?