[
https://issues.apache.org/jira/browse/TIKA-4220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17850804#comment-17850804
]
Hudson commented on TIKA-4220:
--
SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk11 #1642 (See
[
https://issues.apache.org/jira/browse/TIKA-4265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17850776#comment-17850776
]
Tim Allison commented on TIKA-4265:
---
It doesn't help at all if there's a modification in tika-core, even
[
https://issues.apache.org/jira/browse/TIKA-4265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17850773#comment-17850773
]
Tim Allison commented on TIKA-4265:
---
I just pushed a demo to {{build-cache}}. This includes
Tim Allison created TIKA-4265:
-
Summary: Consider adding maven build cache extension
Key: TIKA-4265
URL: https://issues.apache.org/jira/browse/TIKA-4265
Project: Tika
Issue Type: Task
[
https://issues.apache.org/jira/browse/TIKA-4220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17850756#comment-17850756
]
ASF GitHub Bot commented on TIKA-4220:
--
tballison merged PR #1790:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17850754#comment-17850754
]
Hudson commented on TIKA-4221:
--
SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk11 #1641 (See
[
https://issues.apache.org/jira/browse/TIKA-4229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17850708#comment-17850708
]
ASF GitHub Bot commented on TIKA-4229:
--
bartek commented on code in PR #1698:
URL: https://github.com
[
https://issues.apache.org/jira/browse/TIKA-4252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17850706#comment-17850706
]
ASF GitHub Bot commented on TIKA-4252:
--
tballison commented on PR #1778:
URL: https://github.com
[
https://issues.apache.org/jira/browse/TIKA-4220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17850703#comment-17850703
]
ASF GitHub Bot commented on TIKA-4220:
--
tballison opened a new pull request, #1790:
URL: https
[
https://issues.apache.org/jira/browse/TIKA-4221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17850704#comment-17850704
]
ASF GitHub Bot commented on TIKA-4221:
--
tballison merged PR #1789:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17850691#comment-17850691
]
ASF GitHub Bot commented on TIKA-4221:
--
tballison opened a new pull request, #1789:
URL: https
Nicholas DiPiazza created TIKA-4264:
---
Summary: Tika Pipes - Structured output (XHTML) support?
Key: TIKA-4264
URL: https://issues.apache.org/jira/browse/TIKA-4264
Project: Tika
Issue Type
Andres Almiray created TIKA-4263:
Summary: Provide full Java module descriptors
Key: TIKA-4263
URL: https://issues.apache.org/jira/browse/TIKA-4263
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-4263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andres Almiray updated TIKA-4263:
-
Description:
`v3.0.0-BETA` defines an automatic module name
```
$ jarviz module name --gav
[
https://issues.apache.org/jira/browse/TIKA-4252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849561#comment-17849561
]
ASF GitHub Bot commented on TIKA-4252:
--
nddipiazza opened a new pull request, #1778:
URL: https
[
https://issues.apache.org/jira/browse/TIKA-4252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849560#comment-17849560
]
ASF GitHub Bot commented on TIKA-4252:
--
nddipiazza closed pull request #1774: TIKA-4252 fetch tuple
[
https://issues.apache.org/jira/browse/TIKA-4262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nicholas DiPiazza closed TIKA-4262.
---
Assignee: Nicholas DiPiazza
Resolution: Invalid
never mind - this was an issue in my
[
https://issues.apache.org/jira/browse/TIKA-4262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nicholas DiPiazza updated TIKA-4262:
Description:
tika configuration when saving a fetcher with a list of strings will look like
Nicholas DiPiazza created TIKA-4262:
---
Summary: In pipes XML config, List serializes incorrect
causing the parameters to be empty when read
Key: TIKA-4262
URL: https://issues.apache.org/jira/browse/TIKA-4262
[
https://issues.apache.org/jira/browse/TIKA-4261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849394#comment-17849394
]
Hudson commented on TIKA-4261:
--
SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk11 #1638 (See
[
https://issues.apache.org/jira/browse/TIKA-4260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849384#comment-17849384
]
ASF GitHub Bot commented on TIKA-4260:
--
tballison commented on PR #1776:
URL: https://github.com
[
https://issues.apache.org/jira/browse/TIKA-4261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849379#comment-17849379
]
ASF GitHub Bot commented on TIKA-4261:
--
tballison merged PR #1777:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849369#comment-17849369
]
ASF GitHub Bot commented on TIKA-4261:
--
tballison opened a new pull request, #1777:
URL: https
Tim Allison created TIKA-4261:
-
Summary: Add attachment type metadata filter
Key: TIKA-4261
URL: https://issues.apache.org/jira/browse/TIKA-4261
Project: Tika
Issue Type: Task
[
https://issues.apache.org/jira/browse/TIKA-4259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849321#comment-17849321
]
Hudson commented on TIKA-4259:
--
SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk11 #1637 (See
[
https://issues.apache.org/jira/browse/TIKA-4259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-4259.
---
Fix Version/s: 3.0.0
Resolution: Fixed
> Decouple xml parser stuff from ParseCont
[
https://issues.apache.org/jira/browse/TIKA-4259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849299#comment-17849299
]
ASF GitHub Bot commented on TIKA-4259:
--
tballison merged PR #1775:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849298#comment-17849298
]
Tim Allison commented on TIKA-4260:
---
That PR currently only works on tika-core. More needs to be done
[
https://issues.apache.org/jira/browse/TIKA-4260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849296#comment-17849296
]
ASF GitHub Bot commented on TIKA-4260:
--
tballison opened a new pull request, #1776:
URL: https
[
https://issues.apache.org/jira/browse/TIKA-4260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849297#comment-17849297
]
ASF GitHub Bot commented on TIKA-4260:
--
tballison commented on PR #1776:
URL: https://github.com
[
https://issues.apache.org/jira/browse/TIKA-4243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849288#comment-17849288
]
Tim Allison commented on TIKA-4243:
---
[~ndipiazza], I added parseContext to fetchers and emitters
[
https://issues.apache.org/jira/browse/TIKA-4243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849103#comment-17849103
]
Tim Allison edited comment on TIKA-4243 at 5/24/24 1:00 PM:
Proposed basic
Tim Allison created TIKA-4260:
-
Summary: Add parse context to the fetcher interface in 3.x
Key: TIKA-4260
URL: https://issues.apache.org/jira/browse/TIKA-4260
Project: Tika
Issue Type: Task
[
https://issues.apache.org/jira/browse/TIKA-4259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849117#comment-17849117
]
ASF GitHub Bot commented on TIKA-4259:
--
tballison opened a new pull request, #1775:
URL: https
Tim Allison created TIKA-4259:
-
Summary: Decouple xml parser stuff from ParseContext
Key: TIKA-4259
URL: https://issues.apache.org/jira/browse/TIKA-4259
Project: Tika
Issue Type: Task
[
https://issues.apache.org/jira/browse/TIKA-4243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849114#comment-17849114
]
Tim Allison commented on TIKA-4243:
---
I'm going to start working on PRs that will be generally helpful
[
https://issues.apache.org/jira/browse/TIKA-4243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849108#comment-17849108
]
Tim Allison commented on TIKA-4243:
---
The downsides we see:
a) if we there's agreement to add jackson
[
https://issues.apache.org/jira/browse/TIKA-4243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849103#comment-17849103
]
Tim Allison commented on TIKA-4243:
---
Proposed basic roadmap:
Serialize ParseContext as is...
Allow
[
https://issues.apache.org/jira/browse/TIKA-4243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17849101#comment-17849101
]
Tim Allison commented on TIKA-4243:
---
Fellow devs, in chatting with Nicholas, we're thinking
[
https://issues.apache.org/jira/browse/TIKA-4243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17848960#comment-17848960
]
Nicholas DiPiazza commented on TIKA-4243:
-
Sure that sounds good. When we chat later today
[
https://issues.apache.org/jira/browse/TIKA-4252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17848959#comment-17848959
]
ASF GitHub Bot commented on TIKA-4252:
--
nddipiazza commented on PR #1774:
URL: https://github.com
[
https://issues.apache.org/jira/browse/TIKA-4252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17848808#comment-17848808
]
ASF GitHub Bot commented on TIKA-4252:
--
nddipiazza opened a new pull request, #1774:
URL: https
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-4258.
---
Resolution: Fixed
Just pushed 2.9.2.1/*-latest
Thank you, all!
> Multi-arch support for doc
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17848341#comment-17848341
]
ASF GitHub Bot commented on TIKA-4258:
--
tballison closed pull request #19: Add Github CI workflows
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17848342#comment-17848342
]
ASF GitHub Bot commented on TIKA-4258:
--
tballison commented on PR #19:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17848338#comment-17848338
]
Hudson commented on TIKA-4166:
--
SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk11 #1636 (See
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17848087#comment-17848087
]
ASF GitHub Bot commented on TIKA-4258:
--
nextgens commented on PR #19:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847996#comment-17847996
]
Hudson commented on TIKA-4257:
--
SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk11 #1635 (See
[
https://issues.apache.org/jira/browse/TIKA-4257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847981#comment-17847981
]
ASF GitHub Bot commented on TIKA-4257:
--
tballison merged PR #1773:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847980#comment-17847980
]
Tim Allison commented on TIKA-4255:
---
Thank you for opening this PR. Are you able to add a small unit
[
https://issues.apache.org/jira/browse/TIKA-4256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-4256.
---
Fix Version/s: 3.0.0
Resolution: Fixed
> Allow inlining of ocr'd text in container docum
[
https://issues.apache.org/jira/browse/TIKA-4257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847972#comment-17847972
]
ASF GitHub Bot commented on TIKA-4257:
--
tballison opened a new pull request, #1773:
URL: https
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847950#comment-17847950
]
Tim Allison commented on TIKA-4258:
---
I'm sure I'll need to modify the PR when I actually go to run
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847949#comment-17847949
]
Tim Allison commented on TIKA-4258:
---
Let's give it a day for fellow devs to weigh
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847947#comment-17847947
]
ASF GitHub Bot commented on TIKA-4258:
--
tballison commented on PR #19:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847945#comment-17847945
]
ASF GitHub Bot commented on TIKA-4258:
--
hegerdes commented on PR #19:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847943#comment-17847943
]
Tim Allison commented on TIKA-4258:
---
And here's the full version:
https://hub.docker.com/layers/apache
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847937#comment-17847937
]
ASF GitHub Bot commented on TIKA-4258:
--
tballison commented on PR #19:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847931#comment-17847931
]
Tim Allison commented on TIKA-4243:
---
Separately, but related to this and also to TIKA-4252 -- should we
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847929#comment-17847929
]
ASF GitHub Bot commented on TIKA-4258:
--
tballison commented on PR #19:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847909#comment-17847909
]
Hudson commented on TIKA-4256:
--
SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk11 #1634 (See
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847905#comment-17847905
]
ASF GitHub Bot commented on TIKA-4258:
--
fpiesche commented on PR #19:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847896#comment-17847896
]
ASF GitHub Bot commented on TIKA-4258:
--
tballison commented on PR #19:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847895#comment-17847895
]
ASF GitHub Bot commented on TIKA-4258:
--
tballison commented on PR #19:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847890#comment-17847890
]
Hudson commented on TIKA-4166:
--
SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk11 #1633 (See
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847887#comment-17847887
]
ASF GitHub Bot commented on TIKA-4258:
--
nextgens commented on PR #19:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847884#comment-17847884
]
ASF GitHub Bot commented on TIKA-4258:
--
tballison commented on PR #19:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847883#comment-17847883
]
Tim Allison commented on TIKA-4258:
---
Helpful links from #infra:
https://infra.apache.org/docker-hub
[
https://issues.apache.org/jira/browse/TIKA-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847882#comment-17847882
]
Tim Allison commented on TIKA-4258:
---
If fellow devs with better knowledge of github actions and docker
Tim Allison created TIKA-4258:
-
Summary: Multi-arch support for docker images
Key: TIKA-4258
URL: https://issues.apache.org/jira/browse/TIKA-4258
Project: Tika
Issue Type: Task
[
https://issues.apache.org/jira/browse/TIKA-4257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Luca Bentivoglio updated TIKA-4257:
---
Description:
Tika detect method sometimes recognizes p7m files as format application/x-dbf
[
https://issues.apache.org/jira/browse/TIKA-4256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847874#comment-17847874
]
ASF GitHub Bot commented on TIKA-4256:
--
tballison merged PR #1762:
URL: https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847868#comment-17847868
]
Hudson commented on TIKA-4166:
--
SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk11 #1632 (See
[
https://issues.apache.org/jira/browse/TIKA-4166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847856#comment-17847856
]
Hudson commented on TIKA-4166:
--
UNSTABLE: Integrated in Jenkins build Tika » tika-main-jdk11 #1631 (See
[
https://issues.apache.org/jira/browse/TIKA-4166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847827#comment-17847827
]
Hudson commented on TIKA-4166:
--
ABORTED: Integrated in Jenkins build Tika » tika-main-jdk11 #1630 (See
[
https://issues.apache.org/jira/browse/TIKA-4257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Luca Bentivoglio updated TIKA-4257:
---
Description:
Tika detect method sometimes recognizes p7m files as format x-dbf
[
https://issues.apache.org/jira/browse/TIKA-4257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Luca Bentivoglio updated TIKA-4257:
---
Description:
Tika detect method sometimes recognizes p7m files as format x-dbf
[
https://issues.apache.org/jira/browse/TIKA-4257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Luca Bentivoglio updated TIKA-4257:
---
Summary: Tika detect() recognizes some p7m files as format x-dbf (was:
Tika detect
[
https://issues.apache.org/jira/browse/TIKA-4257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Luca Bentivoglio updated TIKA-4257:
---
Summary: Tika detect() riconosce alcuni file p7m come formato x-dbf (was:
Tika detect
[
https://issues.apache.org/jira/browse/TIKA-4257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Luca Bentivoglio updated TIKA-4257:
---
Summary: Tika detect riconosce alcuni file p7m come formato x-dbf (was:
Riconoscimento file
Luca Bentivoglio created TIKA-4257:
--
Summary: Riconoscimento file p7m
Key: TIKA-4257
URL: https://issues.apache.org/jira/browse/TIKA-4257
Project: Tika
Issue Type: Bug
Components
[
https://issues.apache.org/jira/browse/TIKA-4256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847335#comment-17847335
]
ASF GitHub Bot commented on TIKA-4256:
--
tballison opened a new pull request, #1762:
URL: https
[
https://issues.apache.org/jira/browse/TIKA-696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847018#comment-17847018
]
Alexey Pismenskiy commented on TIKA-696:
Hey [~nick] , we would be interested in this - any updates
[
https://issues.apache.org/jira/browse/TIKA-4256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-4256:
--
Description:
For legacy tika, we're inlining all content from embedded files including ocr
content
[
https://issues.apache.org/jira/browse/TIKA-4256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-4256:
--
Description:
For legacy tika, we're inlining all content from embedded files including ocr
content
Tim Allison created TIKA-4256:
-
Summary: Allow inlining of ocr'd text in container document
Key: TIKA-4256
URL: https://issues.apache.org/jira/browse/TIKA-4256
Project: Tika
Issue Type: Task
[
https://issues.apache.org/jira/browse/TIKA-4255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17846908#comment-17846908
]
ASF GitHub Bot commented on TIKA-4255:
--
axeld opened a new pull request, #1761:
URL: https
Axel Dörfler created TIKA-4255:
--
Summary: TextAndCSVParser ignores Metadata.CONTENT_ENCODING
Key: TIKA-4255
URL: https://issues.apache.org/jira/browse/TIKA-4255
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-1907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-1907:
--
Fix Version/s: 3.0.0
> Big Pdf parsing to text - Out of mem
[
https://issues.apache.org/jira/browse/TIKA-4137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17846697#comment-17846697
]
Tim Allison commented on TIKA-4137:
---
Y, done just now.
> Building current Tika main branch fails un
[
https://issues.apache.org/jira/browse/TIKA-4137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-4137:
--
Fix Version/s: 2.9.3
> Building current Tika main branch fails under Java 20
[
https://issues.apache.org/jira/browse/TIKA-4137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17846672#comment-17846672
]
Roberto Franchini commented on TIKA-4137:
-
Could you please backport this small fix on 2.9.x
[
https://issues.apache.org/jira/browse/TIKA-4170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17846142#comment-17846142
]
Tika User commented on TIKA-4170:
-
Any update on this ?
> Tika to extract Apple Key fi
[
https://issues.apache.org/jira/browse/TIKA-4254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17845590#comment-17845590
]
Tilman Hausherr edited comment on TIKA-4254 at 5/12/24 9:40 AM:
THausherr
[
https://issues.apache.org/jira/browse/TIKA-4254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17845649#comment-17845649
]
ASF GitHub Bot commented on TIKA-4254:
--
kaiyaok2 commented on PR #1754:
URL: https://github.com
[
https://issues.apache.org/jira/browse/TIKA-4252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17845623#comment-17845623
]
ASF GitHub Bot commented on TIKA-4252:
--
nddipiazza commented on code in PR #1753:
URL: https
[
https://issues.apache.org/jira/browse/TIKA-4254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17845595#comment-17845595
]
ASF GitHub Bot commented on TIKA-4254:
--
kaiyaok2 commented on PR #1754:
URL: https://github.com
[
https://issues.apache.org/jira/browse/TIKA-4254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17845590#comment-17845590
]
ASF GitHub Bot commented on TIKA-4254:
--
THausherr commented on PR #1754:
URL: https://github.com
[
https://issues.apache.org/jira/browse/TIKA-4254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17845586#comment-17845586
]
ASF GitHub Bot commented on TIKA-4254:
--
kaiyaok2 commented on PR #1754:
URL: https://github.com
[
https://issues.apache.org/jira/browse/TIKA-4254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kaiyao Ke updated TIKA-4254:
Description:
### Brief Description of the Bug
The test `TestMimeTypes#testJavaRegex` is non-idempotent
1 - 100 of 31043 matches
Mail list logo