[ 
https://issues.apache.org/jira/browse/TIKA-1759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14940009#comment-14940009
 ] 

Tim Allison commented on TIKA-1759:
-----------------------------------

[~tilman], I think we're good with PDAnnotationMarkup's {{getTitlePopup()}} to 
extract what I'm currently calling "commentAuthor", and we're good with 
"digitalSigner" via PDSignature's {{getName}}.

How would we/could we extract authorship information from different versions 
(equivalent to MSOffice's track changes) in PDFs? 

> Extract contributor metadata from supporting file formats
> ---------------------------------------------------------
>
>                 Key: TIKA-1759
>                 URL: https://issues.apache.org/jira/browse/TIKA-1759
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Tim Allison
>            Priority: Minor
>         Attachments: contributors.zip
>
>
> Many common file formats store information about contributors (broadly 
> speaking) to a document.  We are currently extracting author/creator and 
> modifier/last author.  Let's add extraction for:
> # comment authors
> # revisers (authors who make changes with track changes on)
> # signers



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to