[
https://issues.apache.org/jira/browse/CONNECTORS-657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13592281#comment-13592281
]
Karl Wright commented on CONNECTORS-657:
----------------------------------------
bq. If we choose to extend RepositoryDocument, I recommend we should allow to
add any field as bq. Date object, i.e.:
bq. doc.addField( "last-modified", new Date() );
bq. doc.addField( "created", new Date() );
The reason I don't think this is sufficient is because each repository
connector will *still* have it's own (different) metadata name, so you still
get no commonality in the output connector. That is why, for common and global
kinds of information, I think we need explicit RepositoryDocument support. For
example:
{code}
doc.setModifiedDate(...);
doc.setCreatedDate(...);
{code}
We already do this for mime type and file name, for very similar reasons.
It would then be easy for an output connector (such as Solr) to map these
values to appropriate search-engine-specific formats.
For other kinds of date attributes that are NOT common across most
repositories, I have no problem also adding a new addField() variant as you
suggest. But this would not likely be used by any connectors at the moment.
> Normalize date/timestamps format across connectors
> --------------------------------------------------
>
> Key: CONNECTORS-657
> URL: https://issues.apache.org/jira/browse/CONNECTORS-657
> Project: ManifoldCF
> Issue Type: Bug
> Components: Framework crawler agent
> Affects Versions: ManifoldCF 1.2
> Reporter: Maciej Lizewski
> Assignee: Karl Wright
> Fix For: ManifoldCF 1.2
>
>
> several connectors add datetime attributes to RepositoryDocument, but they do
> not have common format. Examples:
> WikiConnector adds last-updated: 2010-10-10T12:34:00Z
> SharedDriveConnector adds last-updated: Thu May 28 17:39:46 CEST 2009
> and so on.
> Solr requires all date/datetime fields to be passed as YYYY-MM-DDTHH:II:SSZ
> We need to standardize formats (my recommendation is solr format) or allow to
> add Date attributes to RepositoryDocument and move formatting to
> OutputConnector.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira