[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2019-03-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16791983#comment-16791983 ] Tim Allison commented on TIKA-2362: --- We _could_ add this, but it feels kind of application specific. If

[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2019-03-12 Thread Rajesh Rajamani (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16790441#comment-16790441 ] Rajesh Rajamani commented on TIKA-2362: --- Hi, Just wanted to see if can have options to specify x-y

[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2018-06-03 Thread Vivek (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16499778#comment-16499778 ] Vivek commented on TIKA-2362: - Request team to kindly include header and footer removal from .pdf, .rtf and

[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2017-06-08 Thread Mujahid Ateeb Khan (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16043863#comment-16043863 ] Mujahid Ateeb Khan commented on TIKA-2362: -- In odt file header data display at end of String.

[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2017-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16043838#comment-16043838 ] Hudson commented on TIKA-2362: -- FAILURE: Integrated in Jenkins build Tika-trunk #1290 (See

[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2017-06-08 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16043797#comment-16043797 ] Tim Allison commented on TIKA-2362: --- I added configurability for doc, docx, xls and xlsx. PDFs are

[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2017-05-17 Thread Mujahid Ateeb Khan (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16014299#comment-16014299 ] Mujahid Ateeb Khan commented on TIKA-2362: -- I tried with odt doc docx and pdf format... >

[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2017-05-17 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16014059#comment-16014059 ] Nick Burch commented on TIKA-2362: -- Which format(s) are you having that problem with? Is that all

[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2017-05-16 Thread Mujahid Ateeb Khan (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16013488#comment-16013488 ] Mujahid Ateeb Khan commented on TIKA-2362: -- Yes I tried that method using XHTML handler but some

[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2017-05-16 Thread Thejan Wijesinghe (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16012245#comment-16012245 ] Thejan Wijesinghe commented on TIKA-2362: - Can't we use regular expressions to detect headers &

[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2017-05-16 Thread Mujahid Ateeb Khan (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16012244#comment-16012244 ] Mujahid Ateeb Khan commented on TIKA-2362: -- Is there any alternate way to skip headers and footers

[jira] [Commented] (TIKA-2362) Skipping Header and Footer data from documents

2017-05-16 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16012238#comment-16012238 ] Tim Allison commented on TIKA-2362: --- There isn't, and it shouldn't be hard to add. Prob won't make it