[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16043838#comment-16043838
]
Hudson commented on TIKA-2362:
------------------------------
FAILURE: Integrated in Jenkins build Tika-trunk #1290 (See
[https://builds.apache.org/job/Tika-trunk/1290/])
TIKA-2362 -- Allow users to turn off extraction of headers and footers
(tallison:
[https://github.com/apache/tika/commit/5cbaed87235c2cee49c9d4fa15d84158d000e986])
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/SXWPFWordExtractorDecorator.java
* (edit) CHANGES.txt
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSSFBExcelExtractorDecorator.java
* (edit)
tika-parsers/src/test/java/org/apache/tika/parser/microsoft/WordParserTest.java
* (edit)
tika-parsers/src/test/java/org/apache/tika/parser/microsoft/ooxml/OOXMLParserTest.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/XWPFWordExtractorDecorator.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSSFExcelExtractorDecorator.java
* (edit)
tika-parsers/src/test/java/org/apache/tika/parser/microsoft/ExcelParserTest.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ExcelExtractor.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/OfficeParserConfig.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/WordExtractor.java
* (edit)
tika-parsers/src/test/java/org/apache/tika/parser/microsoft/ooxml/SXWPFExtractorTest.java
> Skipping Header and Footer data from documents
> ----------------------------------------------
>
> Key: TIKA-2362
> URL: https://issues.apache.org/jira/browse/TIKA-2362
> Project: Tika
> Issue Type: Wish
> Components: general, handler
> Reporter: Mujahid Ateeb Khan
> Assignee: Tim Allison
> Priority: Trivial
>
> Is there any method to skip header and footer data of
> documents(pdf,docx,doc,odt)?
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)