Steve Gullion created TIKA-1440: ----------------------------------- Summary: Auto-Paragraph numbers not extracted from Word Document Key: TIKA-1440 URL: https://issues.apache.org/jira/browse/TIKA-1440 Project: Tika Issue Type: Bug Components: parser Environment: Windows 7, Windows Server 2008, Tomcat Reporter: Steve Gullion Priority: Minor
When the text is extracted from a Microsoft Word document that uses automatic numbering, the text of the automatic numbers is not extracted. As the numbers can be critical to the meaning of the document (as in the case of cross-references), they should be calculated and extracted if at all possible. -- This message was sent by Atlassian JIRA (v6.3.4#6332)