Handling non-ingested documents in a repository connector needs to be made
consistent and easy-to-understand
------------------------------------------------------------------------------------------------------------
Key: CONNECTORS-216
URL: https://issues.apache.org/jira/browse/CONNECTORS-216
Project: ManifoldCF
Issue Type: Improvement
Components: Framework crawler agent
Affects Versions: ManifoldCF 0.2, ManifoldCF 0.1, ManifoldCF 0.3
Reporter: Karl Wright
The way a connector writer is supposed to handle documents that should not be
indexed is not clear. The IProcessActivity interface provides a method,
deleteDocument(id), which removes the document from the index. But it does not
record the version string when it does this, making that method only really
useful for the case where the version string is empty. For cases where the
version string is non-empty, you are supposed to use
ingestDocument(id,version,null,null), which works fine to record the version
string and allow for change detection, but is very non-obvious to the user.
Suggestion is to add a new variant of deleteDocument(), which takes both an id
and version, and decides automatically whether to use ingestDocument() or
deleteDocument() based on the contents of the version string. Also, the old
deleteDocument(id) variant should be deprecated.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira