[
https://issues.apache.org/jira/browse/TIKA-4170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17846142#comment-17846142
]
Tika User commented on TIKA-4170:
-
Any update on this ?
> Tika to extract Apple Key files
>
[
https://issues.apache.org/jira/browse/TIKA-4249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843211#comment-17843211
]
Tika User commented on TIKA-4249:
-
[~tallison] Any idea when was the next release date?
> EML file is
[
https://issues.apache.org/jira/browse/TIKA-4249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17842716#comment-17842716
]
Tika User edited comment on TIKA-4249 at 5/1/24 4:38 PM:
-
[~tallison] May I know
[
https://issues.apache.org/jira/browse/TIKA-4249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17842716#comment-17842716
]
Tika User commented on TIKA-4249:
-
May I know when these changes available, like to know the version
[
https://issues.apache.org/jira/browse/TIKA-4249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-4249:
Description: We recently upgrade from 2.9.0 to 2.9.2. In that we found that
the attached file is treating
[
https://issues.apache.org/jira/browse/TIKA-4249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-4249:
Attachment: (was: Email_Received.txt)
> EML file is treating it as text file in 3.9.2 version
>
Tika User created TIKA-4249:
---
Summary: EML file is treating it as text file in 3.9.2 version
Key: TIKA-4249
URL: https://issues.apache.org/jira/browse/TIKA-4249
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-4210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17827036#comment-17827036
]
Tika User commented on TIKA-4210:
-
The attached file is doc extension and from that file it should detect
[
https://issues.apache.org/jira/browse/TIKA-4210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-4210:
Description:
Hi Team,
The attached embedded file contain .MPGA attachments which tika is not able to
Tika User created TIKA-4210:
---
Summary: Not able to identify tika extension
Key: TIKA-4210
URL: https://issues.apache.org/jira/browse/TIKA-4210
Project: Tika
Issue Type: Bug
Reporter:
[
https://issues.apache.org/jira/browse/TIKA-4170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17787819#comment-17787819
]
Tika User commented on TIKA-4170:
-
Hi Allison,
Our observation is Tika is not extracting
Tika User created TIKA-4170:
---
Summary: Tika to extract Apple Key files
Key: TIKA-4170
URL: https://issues.apache.org/jira/browse/TIKA-4170
Project: Tika
Issue Type: Bug
Reporter: Tika
[
https://issues.apache.org/jira/browse/TIKA-3981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3981:
Attachment: Tika_Testing.docx
> Tika parser meets window system file
>
[
https://issues.apache.org/jira/browse/TIKA-3981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17694330#comment-17694330
]
Tika User commented on TIKA-3981:
-
Hi [~nick] ,
Only the special files, existed in the
Tika User created TIKA-3981:
---
Summary: Tika parser meets window system file
Key: TIKA-3981
URL: https://issues.apache.org/jira/browse/TIKA-3981
Project: Tika
Issue Type: Bug
Reporter:
[
https://issues.apache.org/jira/browse/TIKA-3953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User closed TIKA-3953.
---
> Message class name value is coding differently
> --
>
>
[
https://issues.apache.org/jira/browse/TIKA-3953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17675689#comment-17675689
]
Tika User commented on TIKA-3953:
-
Understood that this not related to tika. Closing the issue.
> Message
[
https://issues.apache.org/jira/browse/TIKA-3953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User resolved TIKA-3953.
-
Resolution: Fixed
> Message class name value is coding differently
>
[
https://issues.apache.org/jira/browse/TIKA-3953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3953:
Summary: Message class name value is coding differently (was: Message
class value is coding differently)
[
https://issues.apache.org/jira/browse/TIKA-3953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3953:
Description:
The message class name values is loading differently compare to 2.4.1 to 2.6.0.
Message class
[
https://issues.apache.org/jira/browse/TIKA-3953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3953:
Attachment: Note.msg
> Message class value is coding differently
>
[
https://issues.apache.org/jira/browse/TIKA-3953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17675659#comment-17675659
]
Tika User commented on TIKA-3953:
-
[~tallison] Attached note file.[^Note.msg]
> Message class value is
Tika User created TIKA-3953:
---
Summary: Message class value is coding differently
Key: TIKA-3953
URL: https://issues.apache.org/jira/browse/TIKA-3953
Project: Tika
Issue Type: Bug
Affects
[
https://issues.apache.org/jira/browse/TIKA-3952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User resolved TIKA-3952.
-
Resolution: Fixed
> Content mismatch
> -
>
> Key: TIKA-3952
>
[
https://issues.apache.org/jira/browse/TIKA-3952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17665545#comment-17665545
]
Tika User commented on TIKA-3952:
-
Got it. Thanks
> Content mismatch
> -
>
>
[
https://issues.apache.org/jira/browse/TIKA-3952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656063#comment-17656063
]
Tika User edited comment on TIKA-3952 at 1/10/23 12:43 PM:
---
[~nick] FYI. I
[
https://issues.apache.org/jira/browse/TIKA-3952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656062#comment-17656062
]
Tika User commented on TIKA-3952:
-
We are not doing any OCR for this. Simple native file and getting all
[
https://issues.apache.org/jira/browse/TIKA-3952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656063#comment-17656063
]
Tika User commented on TIKA-3952:
-
FYI. I attached PDF file for your reference.
> Content mismatch
>
[
https://issues.apache.org/jira/browse/TIKA-3952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656059#comment-17656059
]
Tika User commented on TIKA-3952:
-
[~nick] I ran this command :
java -jar pdfbox-app.2.0.27.jar
Tika User created TIKA-3952:
---
Summary: Content mismatch
Key: TIKA-3952
URL: https://issues.apache.org/jira/browse/TIKA-3952
Project: Tika
Issue Type: Bug
Affects Versions: 2.6.0
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17576563#comment-17576563
]
Tika User commented on TIKA-3827:
-
[~tallison] Image data documentation.
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575945#comment-17575945
]
Tika User edited comment on TIKA-3827 at 8/5/22 11:20 PM:
--
Okay
was (Author:
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575945#comment-17575945
]
Tika User commented on TIKA-3827:
-
When this fix will be available? Next version?
> Word Document
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575881#comment-17575881
]
Tika User commented on TIKA-3827:
-
Below is the code:
You can easily extract text from the document
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575281#comment-17575281
]
Tika User edited comment on TIKA-3827 at 8/4/22 1:57 PM:
-
[^example.zip]
Attached
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575281#comment-17575281
]
Tika User commented on TIKA-3827:
-
[^example.zip]
Attached the Zip file , the content and two png files.
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3827:
Attachment: example.zip
> Word Document extracted mpga file extension instead of bitmap
>
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575208#comment-17575208
]
Tika User edited comment on TIKA-3827 at 8/4/22 12:57 PM:
--
[~tallison] I tried
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575208#comment-17575208
]
Tika User commented on TIKA-3827:
-
I tried using the same file using below link and both the attachments
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575193#comment-17575193
]
Tika User commented on TIKA-3827:
-
But if we look into the original document we are seeing those images.
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3827:
Attachment: image-2022-08-04-15-45-10-892.png
> Word Document extracted mpga file extension instead of
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3827:
Attachment: image-2022-08-04-15-44-48-396.png
> Word Document extracted mpga file extension instead of
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575047#comment-17575047
]
Tika User commented on TIKA-3827:
-
When I tried to open the extracted file in paint I am seeing the
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3827:
Attachment: image-2022-08-04-10-53-48-894.png
> Word Document extracted mpga file extension instead of
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3827:
Attachment: image-2022-08-04-10-52-44-800.png
> Word Document extracted mpga file extension instead of
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17574765#comment-17574765
]
Tika User commented on TIKA-3827:
-
I think both because when I process extracted document separately it is
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17574757#comment-17574757
]
Tika User commented on TIKA-3827:
-
I think based on above document, I can say that from Tika should turn
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17574695#comment-17574695
]
Tika User edited comment on TIKA-3827 at 8/3/22 12:17 PM:
--
Its is reading it as
[
https://issues.apache.org/jira/browse/TIKA-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17574695#comment-17574695
]
Tika User commented on TIKA-3827:
-
Its file type is reading it as RF and while extracting the content
Tika User created TIKA-3827:
---
Summary: Word Document extracted mpga file extension instead of
bitmap
Key: TIKA-3827
URL: https://issues.apache.org/jira/browse/TIKA-3827
Project: Tika
Issue Type:
[
https://issues.apache.org/jira/browse/TIKA-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User closed TIKA-3647.
---
Resolution: Fixed
> Failed to get content and metadata for .hwp files
>
[
https://issues.apache.org/jira/browse/TIKA-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3647:
Attachment: (was: test.hwp)
> Failed to get content and metadata for .hwp files
>
[
https://issues.apache.org/jira/browse/TIKA-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17475346#comment-17475346
]
Tika User commented on TIKA-3647:
-
Yep. Included detector working fine now.Thanks.
Closing the issue
>
[
https://issues.apache.org/jira/browse/TIKA-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17475329#comment-17475329
]
Tika User commented on TIKA-3647:
-
Entire metadata is missing.
getting only two :
X-TIKA:Parsed-By :
[
https://issues.apache.org/jira/browse/TIKA-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3647:
Attachment: test.hwp
> Failed to get content and metadata for .hwp files
>
[
https://issues.apache.org/jira/browse/TIKA-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3647:
Attachment: (was: P1.PC.0071.hwp)
> Failed to get content and metadata for .hwp files
>
[
https://issues.apache.org/jira/browse/TIKA-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3647:
Attachment: P1.PC.0071.hwp
> Failed to get content and metadata for .hwp files
>
Tika User created TIKA-3647:
---
Summary: Failed to get content and metadata for .hwp files
Key: TIKA-3647
URL: https://issues.apache.org/jira/browse/TIKA-3647
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-3646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apachae Tika User updated TIKA-3646:
Description:
I was using ScreenToGif tool which allos to record screen and create gifs or
Apachae Tika User created TIKA-3646:
---
Summary: MP4 files have their mime type detected as video/quicktime
Key: TIKA-3646
URL: https://issues.apache.org/jira/browse/TIKA-3646
Project: Tika
[
https://issues.apache.org/jira/browse/TIKA-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17474650#comment-17474650
]
Tika User commented on TIKA-3642:
-
Tried using setMaxMainMemoryBytes still seeing memory issues. The same
[
https://issues.apache.org/jira/browse/TIKA-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17472746#comment-17472746
]
Tika User commented on TIKA-3642:
-
Thanks for the update [~tallison] . I am doing some testing with this
[
https://issues.apache.org/jira/browse/TIKA-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17472284#comment-17472284
]
Tika User edited comment on TIKA-3642 at 1/11/22, 6:14 AM:
---
[~tallison]
[
https://issues.apache.org/jira/browse/TIKA-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3642:
Description:
When parsing large PDF files(1.65 GB) we are getting out of memory error. The
version we are
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17472195#comment-17472195
]
Tika User commented on TIKA-3634:
-
The fix should be available in next release? For now I handled in our
Tika User created TIKA-3642:
---
Summary: Getting java.lang.OutOfMemoryError: Java heap space when
parsing PDF file
Key: TIKA-3642
URL: https://issues.apache.org/jira/browse/TIKA-3642
Project: Tika
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17466374#comment-17466374
]
Tika User edited comment on TIKA-3634 at 12/29/21, 9:30 AM:
For 2.0.0 version
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17466374#comment-17466374
]
Tika User commented on TIKA-3634:
-
For 2.0.0 version the .key files mime type is correct but failing to
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17466053#comment-17466053
]
Tika User commented on TIKA-3634:
-
Fyi. This is working fine in 2.0.0 version. Able to get correct file
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Attachment: mortgagecalculator.numbers
> Failed to Parser Apple related files
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Attachment: brochure.pages
> Failed to Parser Apple related files
>
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Attachment: keynotecreated.key
> Failed to Parser Apple related files
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Attachment: (was: Bug664940.zip)
> Failed to Parser Apple related files
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Attachment: Bug664940.zip
> Failed to Parser Apple related files
>
>
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Attachment: (was: keynotecreated.key)
> Failed to Parser Apple related files
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Attachment: (was: brochure.pages)
> Failed to Parser Apple related files
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Attachment: (was: mortgagecalculator.numbers)
> Failed to Parser Apple related files
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Fix Version/s: 1.27
(was: 2.2.0)
> Failed to Parser Apple related files
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Attachment: brochure.pages
> Failed to Parser Apple related files
>
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Attachment: keynotecreated.key
> Failed to Parser Apple related files
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User updated TIKA-3634:
Attachment: mortgagecalculator.numbers
> Failed to Parser Apple related files
>
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17465675#comment-17465675
]
Tika User edited comment on TIKA-3634 at 12/27/21, 10:35 AM:
-
.
was (Author:
[
https://issues.apache.org/jira/browse/TIKA-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17465675#comment-17465675
]
Tika User commented on TIKA-3634:
-
[~tallison]
> Failed to Parser Apple related files
>
[
https://issues.apache.org/jira/browse/TIKA-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tika User closed TIKA-3633.
---
Resolution: Fixed
> Failed to process Embedded Files
>
>
>
Tika User created TIKA-3634:
---
Summary: Failed to Parser Apple related files
Key: TIKA-3634
URL: https://issues.apache.org/jira/browse/TIKA-3634
Project: Tika
Issue Type: Bug
Components:
[
https://issues.apache.org/jira/browse/TIKA-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17464520#comment-17464520
]
Tika User commented on TIKA-3633:
-
Thanks for the update. WIll it resolve in that version?
> Failed to
86 matches
Mail list logo