Here's my +1
On 1 15 2021, at 2:44, Tilman Hausherr wrote:
> +1
>
> Tilman
> Am 14.01.2021 um 02:19 schrieb Tim Allison:
> > All,
> >
> > A candidate for the Tika 2.0.0-ALPHA release is available at:
> > https://dist.apache.org/repos/dist/dev/tika/
> >
> > The release candidate is a zip archive
[
https://issues.apache.org/jira/browse/TIKA-3180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17252556#comment-17252556
]
Peter Lee commented on TIKA-3180:
-
It works now. :)
> Tika 2.0.0 -- Modularize tika-ser
[
https://issues.apache.org/jira/browse/TIKA-3180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17251612#comment-17251612
]
Peter Lee commented on TIKA-3180:
-
Seems some tests are failed, see
[https://ci-builds.apache.org/job
> That one (0810700) I wanted to commit.
>
I see. Everything looks good now. :)
Lee
On 12 14 2020, at 4:35, Tilman Hausherr wrote:
> Am 14.12.2020 um 08:48 schrieb Peter Lee:
> > Seems the latest commit 7f65d61 is exactly the same as dd85c73:
> > https://github.com/apache
meone please verify this:
> the last good commit is from Peter Lee "Simplify init code of some Set
> and List".
> then I made a small commit "TIKA-3248: avoid ClassCastException" of
> about 10 lines.
>
> then "bad" things happened.
> Ideally
[
https://issues.apache.org/jira/browse/TIKA-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Peter Lee resolved TIKA-3218.
-
Fix Version/s: 2.0.0
Resolution: Fixed
> Wrong comment for method sortLoadedClas
[
https://issues.apache.org/jira/browse/TIKA-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17244377#comment-17244377
]
Peter Lee commented on TIKA-3218:
-
Thank you for fix this (y)
> Wrong comment for met
Many thanks to you, Tim. :)
Hi, all
I'm Peter Lee and I was a Apache Commons committer. I'm familiar with many
archivers and compressors. Feel free to ask me if you have some problems in
compression.
I'm honored to be part of Tika. Tika is great and it helped me a lot. Besides,
Tika is a great
Got the same problem.
After some investigation I believe it's caused by the version of
maven-bundle-plugin :
I can successfully build branch_1x with version 4.1.0, but failed with version
4.2.0, 4.2.1 and 5.1.1
Still working on finding out what's wrong here. Here this helps.
cheers,
Lee
On 11
[
https://issues.apache.org/jira/browse/TIKA-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17227108#comment-17227108
]
Peter Lee commented on TIKA-3218:
-
_so that user-provided ones would come first and would be able
Peter Lee created TIKA-3218:
---
Summary: Wrong comment for method sortLoadedClasses in
ServiceLoaderUtils
Key: TIKA-3218
URL: https://issues.apache.org/jira/browse/TIKA-3218
Project: Tika
Issue
[
https://issues.apache.org/jira/browse/TIKA-3213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17220030#comment-17220030
]
Peter Lee commented on TIKA-3213:
-
This fork repository don't support Chinese charset detect since version
[
https://issues.apache.org/jira/browse/TIKA-3209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17217212#comment-17217212
]
Peter Lee commented on TIKA-3209:
-
Hi [~nick]
Just replace PicturesSource in Tika with PictureRunMapper
[
https://issues.apache.org/jira/browse/TIKA-3209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216391#comment-17216391
]
Peter Lee commented on TIKA-3209:
-
[~nick]
Could you give some advice ?
Can we remove that line in POI
Peter Lee created TIKA-3209:
---
Summary: Different between PictureRunMapper in POI and
PicturesSource in Tika
Key: TIKA-3209
URL: https://issues.apache.org/jira/browse/TIKA-3209
Project: Tika
Issue
[
https://issues.apache.org/jira/browse/TIKA-3196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17200448#comment-17200448
]
Peter Lee edited comment on TIKA-3196 at 9/23/20, 2:13 AM:
---
Hi [~tallison]
I
[
https://issues.apache.org/jira/browse/TIKA-3196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17200448#comment-17200448
]
Peter Lee commented on TIKA-3196:
-
Hi [~tallison]
I wrote a test here :
[https://github.com/apache/tika
[
https://issues.apache.org/jira/browse/TIKA-3197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Peter Lee resolved TIKA-3197.
-
Resolution: Not A Problem
> TikaInputStream may not be clo
Peter Lee created TIKA-3197:
---
Summary: TikaInputStream may not be closed
Key: TIKA-3197
URL: https://issues.apache.org/jira/browse/TIKA-3197
Project: Tika
Issue Type: Bug
Components
Anything else?
>
> On Tue, Sep 8, 2020 at 9:56 PM Peter Lee wrote:
> > Hi Tim,
> >
> > I pushed some bugfix PRs in github and maybe we could have a look if they
> > should be merged into branch_1x :
> > #330 : URLs update
> > #340 : some minor fix T
Hi Tim,
I pushed some bugfix PRs in github and maybe we could have a look if they
should be merged into branch_1x :
#330 : URLs update
#340 : some minor fix TikaCLI
#347 : minor fix for BatchProcessBuilder
#353 : fix for tests failure for those developers whose default language is not
English
th GeoParser and SentimentAnalysisParser on
> the main branch. Removing the Logger fixes both and it builds cleanly. Still
> not sure what the exact issue is but I can recreate the issue and your
> solution.
> - Bob
> On 8/24/2020 4:02 AM, Peter Lee wrote:
> >
> > Update :
>
Update :
It works after I removed the loggers in GeoParser and GeoParserConfig. But I'm
still not clear what exactly the problem is. :(
Lee
On 8 24 2020, at 3:27 , Peter Lee wrote:
> Hi all,
>
> The tests are failing on my windows : the GeoParserTest are failing cause the
Hi all,
The tests are failing on my windows : the GeoParserTest are failing cause the
class org.apache.tika.parser.geo.GeoParser cloud not be found. But everything
works fine on my Ubuntu.
The error is wired. I did some googling but couldn't figure out what's the
problem.
Anyone who got same
Hi Tilman,
> expected: but was: charset=[windows-1252]>
I think this problem is caused by the charset detection strategy basing on line
separator(CRLF or LF) and the git autocrlf config. I also met this problem and
solved it like this :
Set autocrlf false by git config --global core.autocrlf
[
https://issues.apache.org/jira/browse/TIKA-1770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178184#comment-17178184
]
Peter Lee commented on TIKA-1770:
-
Test 3 given file in tika-1.24.1 . here is tika content-type detection
[
https://issues.apache.org/jira/browse/TIKA-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176206#comment-17176206
]
Peter Lee commented on TIKA-3155:
-
According to my understanding , here is how Tika handle csv file :
1
[
https://issues.apache.org/jira/browse/TIKA-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17175290#comment-17175290
]
Peter Lee commented on TIKA-3155:
-
We can do it in _TextAndCSVParser_ like this
{code:java}
CSVFormat
[
https://issues.apache.org/jira/browse/TIKA-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17175286#comment-17175286
]
Peter Lee commented on TIKA-3155:
-
Hey. I think it's caused by the Quote Mode of Apache Commons CSV. We
Hi all,
I'm working with TIKA-3141 recently and pushed a PR in github. As Keith
suggested in the PR, maybe we should add Commons Lang to tika-core, as it seems
Commons Lang are being used elsewhere in tika but not tika-core.
Ideas?
cheers,
Lee
Hi all,
I'm using Tika recently and found it fascinating!
I pushed some PRs on github but it seems no one is reviewing(so are some other
PRs on github). Maybe somebody could give me a hand?
Here are the PRs:
https://github.com/apache/tika/pull/334
[
https://issues.apache.org/jira/browse/TIKA-3141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17167845#comment-17167845
]
Peter Lee commented on TIKA-3141:
-
Hi [~nick], I'm working on Tika recently and I'm interested
32 matches
Mail list logo