[ 
https://issues.apache.org/jira/browse/TIKA-1440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14380393#comment-14380393
 ] 

Tim Allison commented on TIKA-1440:
-----------------------------------

Able to post a mock-up document and expected output?  Can't tell if we'll be 
able to do this at the Tika level or if we'll need mods to POI.

> Auto-Paragraph numbers not extracted from Word Document 
> --------------------------------------------------------
>
>                 Key: TIKA-1440
>                 URL: https://issues.apache.org/jira/browse/TIKA-1440
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>         Environment: Windows 7, Windows Server 2008, Tomcat
>            Reporter: Steve Gullion
>            Priority: Minor
>              Labels: numbering, paragraph, word
>
> When the text is extracted from a Microsoft Word document that uses automatic 
> numbering, the text of the automatic numbers is not extracted. As the numbers 
> can be critical to the meaning of the document (as in the case of 
> cross-references), they should be calculated and extracted if at all possible.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to