[ 
https://issues.apache.org/jira/browse/CONNECTORS-657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13592266#comment-13592266
 ] 

Maciej Lizewski commented on CONNECTORS-657:
--------------------------------------------

Unfortunately last-modified is sent to output connetors as it is added to 
RepositoryDocument with 'addField' in processDocument and all such fields are 
sent to output connector. Maybe most indexes ignore this attribute, but my 
configuration does not... I am ok with different timestamp formats used to 
create version strings, only those sent as metadata to output connectors make 
difference.

If we choose to extend RepositoryDocument, I recommend we should allow to add 
any field as Date object, i.e.:
doc.addField( "last-modified", new Date() );
doc.addField( "created", new Date() );

and then in OutputConnector you can cast it to string, format, or do whatever 
needed for connected index before posting. Anyway - if we add Date type fields 
(in general or only lastModified) there will be also incompatibility because 
you will have to drop value passed as "lastModified" and use only this passed 
as special attribute in RepositoryDocument...
                
> Normalize date/timestamps format across connectors
> --------------------------------------------------
>
>                 Key: CONNECTORS-657
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-657
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Framework crawler agent
>    Affects Versions: ManifoldCF 1.2
>            Reporter: Maciej Lizewski
>            Assignee: Karl Wright
>             Fix For: ManifoldCF 1.2
>
>
> several connectors add datetime attributes to RepositoryDocument, but they do 
> not have common format. Examples:
> WikiConnector adds last-updated: 2010-10-10T12:34:00Z
> SharedDriveConnector adds last-updated: Thu May 28 17:39:46 CEST 2009
> and so on.
> Solr requires all date/datetime fields to be passed as YYYY-MM-DDTHH:II:SSZ
> We need to standardize formats (my recommendation is solr format) or allow to 
> add Date attributes to RepositoryDocument and move formatting to 
> OutputConnector.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to