Handling non-ingested documents in a repository connector needs to be made 
consistent and easy-to-understand
------------------------------------------------------------------------------------------------------------

                 Key: CONNECTORS-216
                 URL: https://issues.apache.org/jira/browse/CONNECTORS-216
             Project: ManifoldCF
          Issue Type: Improvement
          Components: Framework crawler agent
    Affects Versions: ManifoldCF 0.2, ManifoldCF 0.1, ManifoldCF 0.3
            Reporter: Karl Wright


The way a connector writer is supposed to handle documents that should not be 
indexed is not clear.  The IProcessActivity interface provides a method, 
deleteDocument(id), which removes the document from the index.  But it does not 
record the version string when it does this, making that method only really 
useful for the case where the version string is empty.  For cases where the 
version string is non-empty, you are supposed to use 
ingestDocument(id,version,null,null), which works fine to record the version 
string and allow for change detection, but is very non-obvious to the user.

Suggestion is to add a new variant of deleteDocument(), which takes both an id 
and version, and decides automatically whether to use ingestDocument() or 
deleteDocument() based on the contents of the version string.  Also, the old 
deleteDocument(id) variant should be deprecated.


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to