Hi Rene, The feeds you are having problems with have content URLs that have no extension. I bet that your ElasticSearch file extensions are not set to include this. There's a special value "." that maps to URLs with no extension.
Karl On Thu, Aug 21, 2014 at 5:54 PM, Rene Nederhand <[email protected]> wrote: > Hi everyone, > > I have a fresh installation of ManifoldCF 1.6.1 (from binary > distribution) and trying to index RSS feeds in Elasticsearch (1.3.2). > > So, I created a job that ingests several feeds. However, it seems some > feeds are parsed, but items won't end in the index. It fetches the > items, but does not send these to ElasticSearch (ES). When I replace > ES as output connector to a file based output connector, the items > _are_ stored as HTML files as they should be. > > Examples: > > Working: > http://www.nu.nl/feeds/rss/algemeen.rss > > Not working: > http://www.hrpraktijk.nl/nieuws/feed > http://www.penoactueel.nl/RSS/Feed/Laatste-nieuws-van-POactueel/ > > What could be the reason for this behaviour? Is there a solution? > Should I create a ticket? > > Thanks, > Rene >
