[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16791983#comment-16791983
]
Tim Allison commented on TIKA-2362:
---
We _could_ add this, but it feels kind of application specific. If
[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16790441#comment-16790441
]
Rajesh Rajamani commented on TIKA-2362:
---
Hi,
Just wanted to see if can have options to specify x-y
[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16499778#comment-16499778
]
Vivek commented on TIKA-2362:
-
Request team to kindly include header and footer removal from .pdf, .rtf and
[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16043863#comment-16043863
]
Mujahid Ateeb Khan commented on TIKA-2362:
--
In odt file header data display at end of String.
[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16043838#comment-16043838
]
Hudson commented on TIKA-2362:
--
FAILURE: Integrated in Jenkins build Tika-trunk #1290 (See
[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16043797#comment-16043797
]
Tim Allison commented on TIKA-2362:
---
I added configurability for doc, docx, xls and xlsx.
PDFs are
[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16014299#comment-16014299
]
Mujahid Ateeb Khan commented on TIKA-2362:
--
I tried with odt doc docx and pdf format...
>
[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16014059#comment-16014059
]
Nick Burch commented on TIKA-2362:
--
Which format(s) are you having that problem with? Is that all
[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16013488#comment-16013488
]
Mujahid Ateeb Khan commented on TIKA-2362:
--
Yes I tried that method using XHTML handler but some
[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16012245#comment-16012245
]
Thejan Wijesinghe commented on TIKA-2362:
-
Can't we use regular expressions to detect headers &
[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16012244#comment-16012244
]
Mujahid Ateeb Khan commented on TIKA-2362:
--
Is there any alternate way to skip headers and footers
[
https://issues.apache.org/jira/browse/TIKA-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16012238#comment-16012238
]
Tim Allison commented on TIKA-2362:
---
There isn't, and it shouldn't be hard to add. Prob won't make it
12 matches
Mail list logo