Consider adding SiteMap support to RSS connector
------------------------------------------------
Key: CONNECTORS-255
URL: https://issues.apache.org/jira/browse/CONNECTORS-255
Project: ManifoldCF
Issue Type: New Feature
Components: RSS connector, Web connector
Affects Versions: ManifoldCF 0.4
Reporter: Karl Wright
Assignee: Karl Wright
Fix For: ManifoldCF 0.4
The RSS connector seems well suited for parsing sitemap XML and just doing the
right thing. I'd propose adding the ability to parse sitemap XML in addition
to parsing RSS and Atom. I would not go so far as automatically picking up the
SiteMap field from robots.txt yet, however, since that would require another
level of indirection that would need to be thought out. A direct reference in
the "RSS URLs" field to the root sitemap URL would be where I'd start.
The Web connector should, of course, also get the ability to do this parsing,
as it has for RSS feeds.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira