dependabot[bot] opened a new pull request, #1662:
URL: https://github.com/apache/tika/pull/1662
[![Dependabot compatibility
dependabot[bot] opened a new pull request, #1661:
URL: https://github.com/apache/tika/pull/1661
Bumps `aws.version` from 1.12.679 to 1.12.680.
Updates `com.amazonaws:aws-java-sdk-s3` from 1.12.679 to 1.12.680
Changelog
Sourced from
dependabot[bot] opened a new pull request, #1660:
URL: https://github.com/apache/tika/pull/1660
Bumps `pdfbox.version` from 3.0.1 to 3.0.2.
Updates `org.apache.pdfbox:xmpbox` from 3.0.1 to 3.0.2
Updates `org.apache.pdfbox:fontbox` from 3.0.1 to 3.0.2
Updates
[
https://issues.apache.org/jira/browse/TIKA-4166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827248#comment-17827248
]
Hudson commented on TIKA-4166:
--
SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk11 #1555 (See
Tim Allison created TIKA-4213:
-
Summary: Improvements to jdbc pipes reporter
Key: TIKA-4213
URL: https://issues.apache.org/jira/browse/TIKA-4213
Project: Tika
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/TIKA-4211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827241#comment-17827241
]
Tim Allison edited comment on TIKA-4211 at 3/14/24 8:20 PM:
Step 3: Is there
[
https://issues.apache.org/jira/browse/TIKA-4211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827230#comment-17827230
]
Tim Allison edited comment on TIKA-4211 at 3/14/24 8:17 PM:
Step 2: In this
[ https://issues.apache.org/jira/browse/TIKA-4211 ]
Tim Allison deleted comment on TIKA-4211:
---
was (Author: talli...@mitre.org):
Or, if you grep for "embeddings" in the in uncompressed zip, can you find a
link to the xlsx file?
> Tika extractor
[
https://issues.apache.org/jira/browse/TIKA-4211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827241#comment-17827241
]
Tim Allison commented on TIKA-4211:
---
Step 3: Is there something like this in /ppt/slides/slide2.xml:
[
https://issues.apache.org/jira/browse/TIKA-4211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827233#comment-17827233
]
Tim Allison commented on TIKA-4211:
---
Or, if you grep for "embeddings" in the in uncompressed zip, can
[
https://issues.apache.org/jira/browse/TIKA-4211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827230#comment-17827230
]
Tim Allison commented on TIKA-4211:
---
In this file within the zip: /ppt/slides/_rels/slide2.xml.rels:
Do
[
https://issues.apache.org/jira/browse/TIKA-4211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827221#comment-17827221
]
Xiaohong Yang commented on TIKA-4211:
-
Hi Tim,
Yes, I found the right file
[
https://issues.apache.org/jira/browse/TIKA-4212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiaohong Yang updated TIKA-4212:
Attachment: tika-config-and-sample-file.zip
> Tika fails to get file extension of file type
Xiaohong Yang created TIKA-4212:
---
Summary: Tika fails to get file extension of file type
image/x-rtf-raw-bitmap
Key: TIKA-4212
URL: https://issues.apache.org/jira/browse/TIKA-4212
Project: Tika
[
https://issues.apache.org/jira/browse/TIKA-4210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827193#comment-17827193
]
Tim Allison commented on TIKA-4210:
---
Those files look like this in the rtf file:
{code:java}
[
https://issues.apache.org/jira/browse/TIKA-4210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827191#comment-17827191
]
Tim Allison commented on TIKA-4210:
---
Nick is right. The file is an RTF file. Tika does find two embedded
[
https://issues.apache.org/jira/browse/TIKA-4211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827190#comment-17827190
]
Tim Allison commented on TIKA-4211:
---
Y, as you point out, Tika works with the example file that you
Xiaohong Yang created TIKA-4211:
---
Summary: Tika extractor fails to extract embedded excel from pptx
Key: TIKA-4211
URL: https://issues.apache.org/jira/browse/TIKA-4211
Project: Tika
Issue
[
https://issues.apache.org/jira/browse/TIKA-4210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827036#comment-17827036
]
Tika User commented on TIKA-4210:
-
The attached file is doc extension and from that file it should detect
dependabot[bot] commented on PR #1659:
URL: https://github.com/apache/tika/pull/1659#issuecomment-1997093738
OK, I won't notify you again about this release, but will get in touch when
a new version is available. If you'd rather skip all updates until the next
major or minor version, let
THausherr closed pull request #1659: Bump com.google.protobuf:protobuf-java
from 3.25.3 to 4.26.0
URL: https://github.com/apache/tika/pull/1659
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the
[
https://issues.apache.org/jira/browse/TIKA-4210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-4210:
Description:
Hi Team,
The attached embedded file contain .MPGA attachments which tika is not able to
[
https://issues.apache.org/jira/browse/TIKA-4210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827017#comment-17827017
]
Nick Burch commented on TIKA-4210:
--
The attached file seems to be an RTF file. I'm not sure what a ".mega
Tika User created TIKA-4210:
---
Summary: Not able to identify tika extension
Key: TIKA-4210
URL: https://issues.apache.org/jira/browse/TIKA-4210
Project: Tika
Issue Type: Bug
Reporter:
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17826996#comment-17826996
]
Tilman Hausherr commented on TIKA-4199:
---
The original error you reported wasn't really a bug in
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17826992#comment-17826992
]
Alexander Veit commented on TIKA-4199:
--
The same error also occurs with Tika 2.9.1 and
THausherr merged PR #1657:
URL: https://github.com/apache/tika/pull/1657
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail:
THausherr merged PR #1658:
URL: https://github.com/apache/tika/pull/1658
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail:
28 matches
Mail list logo