Hi Karl, Thanks a lot! You've saved my day.
This was exactly the reason the feeds didn't work. Glad it was that simple. Even if it took me hours to find it was just a dot to add at the correct place :) Cheers, Rene On Fri, Aug 22, 2014 at 1:28 AM, Karl Wright <[email protected]> wrote: > Hi Rene, > > The feeds you are having problems with have content URLs that have no > extension. I bet that your ElasticSearch file extensions are not set to > include this. There's a special value "." that maps to URLs with no > extension. > > Karl > > > > On Thu, Aug 21, 2014 at 5:54 PM, Rene Nederhand <[email protected]> wrote: >> >> Hi everyone, >> >> I have a fresh installation of ManifoldCF 1.6.1 (from binary >> distribution) and trying to index RSS feeds in Elasticsearch (1.3.2). >> >> So, I created a job that ingests several feeds. However, it seems some >> feeds are parsed, but items won't end in the index. It fetches the >> items, but does not send these to ElasticSearch (ES). When I replace >> ES as output connector to a file based output connector, the items >> _are_ stored as HTML files as they should be. >> >> Examples: >> >> Working: >> http://www.nu.nl/feeds/rss/algemeen.rss >> >> Not working: >> http://www.hrpraktijk.nl/nieuws/feed >> http://www.penoactueel.nl/RSS/Feed/Laatste-nieuws-van-POactueel/ >> >> What could be the reason for this behaviour? Is there a solution? >> Should I create a ticket? >> >> Thanks, >> Rene > >
