Tim Allison created TIKA-1124:
-
Summary: Nested documents not extracted if a PDF file is in the
chain
Key: TIKA-1124
URL: https://issues.apache.org/jira/browse/TIKA-1124
Project: Tika
Issue
[
https://issues.apache.org/jira/browse/TIKA-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1124:
--
Attachment: pdf_attachment_issues.zip
outer.docx contains the attached.pdf, which itself contains an
[
https://issues.apache.org/jira/browse/TIKA-1130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13676495#comment-13676495
]
Tim Allison commented on TIKA-1130:
---
I've submitted a patch to POI for this
[
https://issues.apache.org/jira/browse/TIKA-1130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13677957#comment-13677957
]
Tim Allison commented on TIKA-1130:
---
I'll try to submit the Tika portion of the POI-54849
[
https://issues.apache.org/jira/browse/TIKA-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13682628#comment-13682628
]
Tim Allison commented on TIKA-1132:
---
Tika gui took longer than I was willing to wait,
[
https://issues.apache.org/jira/browse/TIKA-1130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13692110#comment-13692110
]
Tim Allison commented on TIKA-1130:
---
Nick,
I think I have to make modifications to
[
https://issues.apache.org/jira/browse/TIKA-1130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13692517#comment-13692517
]
Tim Allison commented on TIKA-1130:
---
Maven proxy setting in my settings.xml file is
[
https://issues.apache.org/jira/browse/TIKA-973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13693068#comment-13693068
]
Tim Allison commented on TIKA-973:
--
Will submit patch and tests by end of the week.
[
https://issues.apache.org/jira/browse/TIKA-1130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1130:
--
Attachment: TIKA-1130.patch
Ray's initial test restored after POI-55142 was committed. Thank you,
[
https://issues.apache.org/jira/browse/TIKA-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-973:
-
Attachment: TIKA-973-patch.tar.gz
Patch attached. Dumps contents of pdf forms at end of document.
[
https://issues.apache.org/jira/browse/TIKA-973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13694774#comment-13694774
]
Tim Allison commented on TIKA-973:
--
Agree on both. Also would appreciate feedback on what
[
https://issues.apache.org/jira/browse/TIKA-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-973:
-
Attachment: i-9_screenshot.png
Screenshot attached. Thanks again to:
Tim Allison created TIKA-1139:
-
Summary: Modify Tika-1129 to test against a local file
Key: TIKA-1139
URL: https://issues.apache.org/jira/browse/TIKA-1139
Project: Tika
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/TIKA-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1139:
--
Attachment: TIKA-1139.patch.tar.gz
Patch attached.
Modify Tika-1129 to test against a
[
https://issues.apache.org/jira/browse/TIKA-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-973:
-
Attachment: TIKA-973.patch.tar.gz
Middle-road change made. The alternate name is an attribute and partial
[
https://issues.apache.org/jira/browse/TIKA-1130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13698009#comment-13698009
]
Tim Allison commented on TIKA-1130:
---
That was fast. Thank you!
.docx
[
https://issues.apache.org/jira/browse/TIKA-1130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13704699#comment-13704699
]
Tim Allison commented on TIKA-1130:
---
Haven't had a chance to build from trunk today, but
[
https://issues.apache.org/jira/browse/TIKA-1130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13704711#comment-13704711
]
Tim Allison commented on TIKA-1130:
---
Tested with freshly built trunk, and the text looks
Tim Allison created TIKA-1150:
-
Summary: Extract text from textbox in XLSX
Key: TIKA-1150
URL: https://issues.apache.org/jira/browse/TIKA-1150
Project: Tika
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/TIKA-1150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1150:
--
Attachment: testEXCEL_textbox.xlsx
Simple file that shows issue.
Extract text from
[
https://issues.apache.org/jira/browse/TIKA-1150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13716429#comment-13716429
]
Tim Allison commented on TIKA-1150:
---
Duplicate of
[
https://issues.apache.org/jira/browse/TIKA-1150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison closed TIKA-1150.
-
Resolution: Duplicate
Extract text from textbox in XLSX
-
[
https://issues.apache.org/jira/browse/TIKA-1100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13716432#comment-13716432
]
Tim Allison commented on TIKA-1100:
---
Waiting for improvements in POI-55292. Will make
[
https://issues.apache.org/jira/browse/TIKA-1100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1100:
--
Attachment: testEXCEL_textbox.xlsx
Simple example file attached for now. Will fill out with test cases
[
https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-792:
-
Attachment: test10.docx
Example document that triggers no such method exceptions for:
CTMarkupRangeImpl,
[
https://issues.apache.org/jira/browse/TIKA-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13729781#comment-13729781
]
Tim Allison commented on TIKA-1124:
---
If anyone has a chance to look into this, I'd
[
https://issues.apache.org/jira/browse/TIKA-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13729903#comment-13729903
]
Tim Allison commented on TIKA-1124:
---
Ok, I think I figured this out... AbstractOOXML
[
https://issues.apache.org/jira/browse/TIKA-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1124:
--
Attachment: TIKA-1124.patch
Chose to move embedded file code into PDF2XHTML. This allows the proper
[
https://issues.apache.org/jira/browse/TIKA-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison closed TIKA-1124.
-
Resolution: Fixed
Fix Version/s: 1.5
Added tests (thanks to Nick's advice to use model of
[
https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733804#comment-13733804
]
Tim Allison commented on TIKA-792:
--
Committed in POI. Once POI3.9beta2 is released, I'll
[
https://issues.apache.org/jira/browse/TIKA-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13736875#comment-13736875
]
Tim Allison commented on TIKA-1153:
---
Fellow Tika committers, I made this change locally
[
https://issues.apache.org/jira/browse/TIKA-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1001:
--
Attachment: TIKA-1001v1.tar.gz
This is a draft that simplifies the extraction of the charset attribute
[
https://issues.apache.org/jira/browse/TIKA-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13740580#comment-13740580
]
Tim Allison commented on TIKA-1001:
---
Fixed as of r1514126. Thank you for submitting this
[
https://issues.apache.org/jira/browse/TIKA-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13741785#comment-13741785
]
Tim Allison commented on TIKA-1153:
---
Committed as of r1514551.
Upgrade
[
https://issues.apache.org/jira/browse/TIKA-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13741785#comment-13741785
]
Tim Allison commented on TIKA-1153:
---
Committed as of r1514551.
Upgrade
[
https://issues.apache.org/jira/browse/TIKA-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison closed TIKA-1001.
-
tika no longer seems to honor HTTP meta tag for arabic text in ISO-8859-6
charset
[
https://issues.apache.org/jira/browse/TIKA-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison closed TIKA-1153.
-
Resolution: Fixed
Upgrade pdfbox to latest 1.8.2 version
--
[
https://issues.apache.org/jira/browse/TIKA-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1001.
---
Resolution: Fixed
tika no longer seems to honor HTTP meta tag for arabic text in ISO-8859-6
[
https://issues.apache.org/jira/browse/TIKA-1162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13742129#comment-13742129
]
Tim Allison commented on TIKA-1162:
---
Would you be willing to attach a document/test case
[
https://issues.apache.org/jira/browse/TIKA-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13742266#comment-13742266
]
Tim Allison commented on TIKA-1001:
---
David,
Thank you for submitting this. I fixed the
[
https://issues.apache.org/jira/browse/TIKA-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reopened TIKA-1132:
---
Assignee: Tim Allison
Will add test case in Tika.
Parsing some XLS documents
Tim Allison created TIKA-1173:
-
Summary: Upgrade to POI-3.10-beta2
Key: TIKA-1173
URL: https://issues.apache.org/jira/browse/TIKA-1173
Project: Tika
Issue Type: Improvement
Reporter:
[
https://issues.apache.org/jira/browse/TIKA-1173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1173.
---
Resolution: Fixed
Upgrade to POI-3.10-beta2
-
Key:
[
https://issues.apache.org/jira/browse/TIKA-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13772116#comment-13772116
]
Tim Allison edited comment on TIKA-1132 at 9/20/13 5:35 PM:
Any
[
https://issues.apache.org/jira/browse/TIKA-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13772116#comment-13772116
]
Tim Allison edited comment on TIKA-1132 at 9/20/13 5:36 PM:
Any
[
https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13773212#comment-13773212
]
Tim Allison commented on TIKA-792:
--
This is now fixed by TIKA-1173.
Can anyone recommend
[
https://issues.apache.org/jira/browse/TIKA-1100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13778801#comment-13778801
]
Tim Allison commented on TIKA-1100:
---
Updated XSSFExcelExtractorDecorator and added test
[
https://issues.apache.org/jira/browse/TIKA-1100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1100.
---
Resolution: Fixed
Fix Version/s: 1.5
r1526498
cannot extract text in
[
https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reopened TIKA-792:
--
added test that catches stderr.
r1526570.
reopening just to record this.
[
https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-792.
--
Resolution: Fixed
NoSuchMethodException CTMarkupImpl.init(org.apache.xmlbeans.SchemaType,
[
https://issues.apache.org/jira/browse/TIKA-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1132.
---
Resolution: Fixed
Resolved with upgrade to poi-3.10-beta2.
Could use help getting jUnit's timeout to
[
https://issues.apache.org/jira/browse/TIKA-1076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1076.
---
Resolution: Fixed
Added some code similar to the fix to POI-54722 to HSLFExtractor. Uncommented
old
[
https://issues.apache.org/jira/browse/TIKA-817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-817.
--
Resolution: Fixed
As mentioned above, this was fixed a while ago. I added test documents from
[
https://issues.apache.org/jira/browse/TIKA-1162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13781922#comment-13781922
]
Tim Allison commented on TIKA-1162:
---
Dear Colleague,
I'm on paternity leave. Will be
[
https://issues.apache.org/jira/browse/TIKA-817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13811201#comment-13811201
]
Tim Allison commented on TIKA-817:
--
Thank you!
(PPT/PPTX) Missing date/time in text
[
https://issues.apache.org/jira/browse/TIKA-1200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1200.
---
Resolution: Fixed
Fixed in r1547037. Waiting for Jenkins to pick up change to confirm. Thank
you!
[
https://issues.apache.org/jira/browse/TIKA-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reassigned TIKA-1201:
-
Assignee: Tim Allison
Add possibility for switching to pdfbox NonSequentialPDFParser
[
https://issues.apache.org/jira/browse/TIKA-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1201:
--
Attachment: TIKA-1201.patch
Trivial patch
Add possibility for switching to pdfbox
[
https://issues.apache.org/jira/browse/TIKA-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1201.
---
Resolution: Fixed
Fix Version/s: 1.5
Basic parameter-based capability added in r1547250. User
Tim Allison created TIKA-1202:
-
Summary: Refactor PDFParser to enable easier parameter setting
Key: TIKA-1202
URL: https://issues.apache.org/jira/browse/TIKA-1202
Project: Tika
Issue Type:
[
https://issues.apache.org/jira/browse/TIKA-1202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1202:
--
Attachment: TIKA-1202.patch
Would appreciate community feedback on this before I commit it (December
Tim Allison created TIKA-1203:
-
Summary: Some metadata not extracted from PDF files when
NonSequentialPDFParser is used
Key: TIKA-1203
URL: https://issues.apache.org/jira/browse/TIKA-1203
Project: Tika
[
https://issues.apache.org/jira/browse/TIKA-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837169#comment-13837169
]
Tim Allison edited comment on TIKA-1201 at 12/3/13 4:25 PM:
[
https://issues.apache.org/jira/browse/TIKA-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13838856#comment-13838856
]
Tim Allison commented on TIKA-1199:
---
Doh! Duplicated Marc's PDFBOX-1783. Sorry about
[
https://issues.apache.org/jira/browse/TIKA-1202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1202.
---
Resolution: Fixed
Fix Version/s: 1.5
Committed in r1548700. Thank you, Mike and Hong-Thai for
[
https://issues.apache.org/jira/browse/TIKA-1202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reopened TIKA-1202:
---
Small bug in using default vs config.
Refactor PDFParser to enable easier parameter setting
[
https://issues.apache.org/jira/browse/TIKA-1202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1202.
---
Resolution: Fixed
r1549646
Refactor PDFParser to enable easier parameter setting
Tim Allison created TIKA-1205:
-
Summary: Allow PDFParser to fallback to other parser if there is
an exception
Key: TIKA-1205
URL: https://issues.apache.org/jira/browse/TIKA-1205
Project: Tika
[
https://issues.apache.org/jira/browse/TIKA-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13845429#comment-13845429
]
Tim Allison commented on TIKA-1205:
---
Thank you for your feedback! TIKA-456 is the
[
https://issues.apache.org/jira/browse/TIKA-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reopened TIKA-973:
--
Assignee: Tim Allison
In hindsight, would prefer to use test documents that are unequivocally
[
https://issues.apache.org/jira/browse/TIKA-1212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13852938#comment-13852938
]
Tim Allison commented on TIKA-1212:
---
On first issue: do you mean that you'd like to have
[
https://issues.apache.org/jira/browse/TIKA-1212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1212:
--
Attachment: abc.zip
Does this test file meet your description?
Recursive Extraction of Archive File
[
https://issues.apache.org/jira/browse/TIKA-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1205:
--
Due Date: 17/Jan/14 (was: 20/Dec/13)
Allow PDFParser to fallback to other parser if there is an
[
https://issues.apache.org/jira/browse/TIKA-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13864393#comment-13864393
]
Tim Allison commented on TIKA-1216:
---
Give this a shot:
[
https://issues.apache.org/jira/browse/TIKA-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1216.
---
Resolution: Fixed
Fix Version/s: 1.5
Following reporter's comment, this looks to be fixed in
[
https://issues.apache.org/jira/browse/TIKA-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13866916#comment-13866916
]
Tim Allison commented on TIKA-1216:
---
Agreed. I didn't think this was a duplicate. It is
[
https://issues.apache.org/jira/browse/TIKA-1215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13869528#comment-13869528
]
Tim Allison commented on TIKA-1215:
---
[~thaichat04] thank you for sending a clean patch.
[
https://issues.apache.org/jira/browse/TIKA-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reassigned TIKA-1226:
-
Assignee: Tim Allison
PDFTextStripper fails while getting data of PDF form fields of type
[
https://issues.apache.org/jira/browse/TIKA-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13880130#comment-13880130
]
Tim Allison commented on TIKA-1226:
---
Eric,
Thank you for reporting this. I'll make the
[
https://issues.apache.org/jira/browse/TIKA-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13880273#comment-13880273
]
Tim Allison commented on TIKA-1226:
---
How about we grab the name?
{noformat}
[
https://issues.apache.org/jira/browse/TIKA-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13881383#comment-13881383
]
Tim Allison commented on TIKA-1226:
---
Thank you for the test file. I'll use that in the
[
https://issues.apache.org/jira/browse/TIKA-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13881383#comment-13881383
]
Tim Allison edited comment on TIKA-1226 at 1/24/14 8:22 PM:
[
https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889697#comment-13889697
]
Tim Allison commented on TIKA-1228:
---
I won't have time to fix this for a week or so, but
[
https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889697#comment-13889697
]
Tim Allison edited comment on TIKA-1228 at 2/3/14 6:09 PM:
---
I
[
https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889697#comment-13889697
]
Tim Allison edited comment on TIKA-1228 at 2/3/14 6:11 PM:
---
I
[
https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1228.
---
Resolution: Fixed
Fix Version/s: 1.5
Fixed in r1564042.
Thank you, [~agi20dla], for reporting
[
https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1228:
--
Comment: was deleted
(was: I won't have time to fix this for a week or so, but, I'll take this
unless
[
https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13890605#comment-13890605
]
Tim Allison commented on TIKA-1228:
---
Not sure I understand. Is this the snippet that you
[
https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13890610#comment-13890610
]
Tim Allison commented on TIKA-1228:
---
Y. That's the point of open source. :) Enjoy!
Now
[
https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13890613#comment-13890613
]
Tim Allison commented on TIKA-1228:
---
Ok, to confirm, the PDNameTreeNode class cast
Tim Allison created TIKA-1230:
-
Summary: Update PDFBox to v1.8.4
Key: TIKA-1230
URL: https://issues.apache.org/jira/browse/TIKA-1230
Project: Tika
Issue Type: Improvement
Affects Versions:
[
https://issues.apache.org/jira/browse/TIKA-1230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1230.
---
Resolution: Fixed
r1564335
Update PDFBox to v1.8.4
---
Key:
Tim Allison created TIKA-1231:
-
Summary: Safely handle null embedded files in PDFs
Key: TIKA-1231
URL: https://issues.apache.org/jira/browse/TIKA-1231
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-1232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reassigned TIKA-1232:
-
Assignee: Tim Allison
Add PDF version to PDFParser output
---
[
https://issues.apache.org/jira/browse/TIKA-1232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13892146#comment-13892146
]
Tim Allison commented on TIKA-1232:
---
How about Application-Version to follow the
Tim Allison created TIKA-1233:
-
Summary: PDFBox can throw StringIndexOutOfBoundsException on some
dates
Key: TIKA-1233
URL: https://issues.apache.org/jira/browse/TIKA-1233
Project: Tika
Issue
[
https://issues.apache.org/jira/browse/TIKA-1232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13893380#comment-13893380
]
Tim Allison commented on TIKA-1232:
---
Interesting. Thank you, [~johanvanderknijff] and
[
https://issues.apache.org/jira/browse/TIKA-1232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13893380#comment-13893380
]
Tim Allison edited comment on TIKA-1232 at 2/6/14 2:31 PM:
---
[
https://issues.apache.org/jira/browse/TIKA-1232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13893426#comment-13893426
]
Tim Allison commented on TIKA-1232:
---
[~anjackson], y, I'd like to add your code if others
[
https://issues.apache.org/jira/browse/TIKA-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1233:
--
Description:
PDFBOX's date parser can throw a StringIndexOutOfBoundsException if a date
string for
1 - 100 of 8779 matches
Mail list logo