[jira] [Commented] (TIKA-3437) Deprecate experimental Preflight pdf parser in 1.x and remove it from 2.x

2021-06-03 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17356710#comment-17356710
 ] 

Hudson commented on TIKA-3437:
--

SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #257 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/257/])
TIKA-3437 -- remove the preflight parser from 2.x (tallison: 
[https://github.com/apache/tika/commit/0a84892e33bb8ba69dcfd7d4928831c85af102a0])
* (delete) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pdf-module/src/test/resources/org/apache/tika/parser/pdf/tika-preflight-config.xml
* (edit) CHANGES.txt
* (edit) tika-bundles/tika-bundle-standard/pom.xml
* (edit) tika-core/src/main/java/org/apache/tika/metadata/PDF.java
* (delete) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pdf-module/src/test/java/org/apache/tika/parser/pdf/PDFPreflightParserTest.java
* (delete) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pdf-module/src/main/java/org/apache/tika/parser/pdf/PDFPreflightParser.java
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pdf-module/pom.xml


> Deprecate experimental Preflight pdf parser in 1.x and remove it from 2.x
> -
>
> Key: TIKA-3437
> URL: https://issues.apache.org/jira/browse/TIKA-3437
> Project: Tika
>  Issue Type: Bug
>Reporter: Tim Allison
>Priority: Major
>
> The PDFBox project has voted to end support for the Preflight parser.  I 
> don't think it is used much.  I think we can safely deprecate it in 1.x and 
> remove it in 2.x.
> https://lists.apache.org/thread.html/rab5e988eda580b1b9fb1653b86e3eaf524cf72a946af027112eed0a1%40%3Cdev.pdfbox.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TIKA-3437) Deprecate experimental Preflight pdf parser in 1.x and remove it from 2.x

2021-06-03 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17356681#comment-17356681
 ] 

Hudson commented on TIKA-3437:
--

UNSTABLE: Integrated in Jenkins build Tika » tika-branch1x-jdk8 #132 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-branch1x-jdk8/132/])
TIKA-3437 -- deprecate experimental PDFPreflightParser (tallison: 
[https://github.com/apache/tika/commit/6a9e72677439ce60b08ecbfb02dca7d710ef70d5])
* (edit) CHANGES.txt
* (edit) 
tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDFPreflightParser.java


> Deprecate experimental Preflight pdf parser in 1.x and remove it from 2.x
> -
>
> Key: TIKA-3437
> URL: https://issues.apache.org/jira/browse/TIKA-3437
> Project: Tika
>  Issue Type: Bug
>Reporter: Tim Allison
>Priority: Major
>
> The PDFBox project has voted to end support for the Preflight parser.  I 
> don't think it is used much.  I think we can safely deprecate it in 1.x and 
> remove it in 2.x.
> https://lists.apache.org/thread.html/rab5e988eda580b1b9fb1653b86e3eaf524cf72a946af027112eed0a1%40%3Cdev.pdfbox.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


Re: new committer: Nicholas DiPiazza

2021-06-03 Thread Luís Filipe Nassif
Welcome on board, Nicholas. Great work!

Best regards,
Luis Filipe Nassif

Em qui., 3 de jun. de 2021 às 16:00, Nicholas DiPiazza <
nicholas.dipia...@gmail.com> escreveu:

> Hi Everyone!
>
> Happy to be one of the committers for Tika!
>
> My name is Nicholas DiPiazza - I reside in Madison, Wisconsin USA. My name
> is Sicilian in origin, and I look Italian... but I'm actually 50% Irish!
>
> I started doing Tika contributions through my work at Lucidworks (
> https://lucidworks.com) building connectors that grab content from various
> data sources such as SharePoint, Google Drive, OneDrive, Alfresco, etc.,
> parsing it using Apache Tika, and eventually indexing it into Solr.
>
> I primarily do back-end Java work but also do work in various languages and
> frameworks. Most recently I have been doing a lot in Scala and Spark.
>
> I have been having a lot of fun making Tika work at a massive scale inside
> Kube containers! I scraped together a homemade version of the Tika Pipes
> project in Tika 1.x to meet some needs I had, and then later collaborated
> with Tim Allison to get this into Tika 2.x. Super stoked to get this in a
> major version of Tika.
>
> Other stuff about me:
>
> I play drums for a metal band called Wake and Prevail
> https://www.reverbnation.com/wakeandprevail although the Covid situation
> has put music on hold indefinitely, I still jam to albums in my basement
> regularly.
>
> I play Starcraft 2 in my spare time, but am stuck in the Diamond League as
> I don't want to hurt my fingers/wrists getting my APM any higher.
>
> I prefer Ubuntu, Windows then Mac in that order. My Mac is actually in a
> box back from when I moved and I have managed not to need it for several
> months now.
>
>
> Looking forward to doing even more contributions throughout the next couple
> years, in particular improving our DWG support and improving the OneNote
> parsing. And hoping to create Tika Pipes tutorials hopefully to help get
> lots of people using that feature so we can get lots of contributions to
> improve it.
>
> Thanks!
> -Nichiolas
>
> On Thu, Jun 3, 2021 at 1:18 PM Tim Allison  wrote:
>
> > The Project Management Committee (PMC) for Apache Tika
> > has invited Nicholas DiPiazza to become a committer and we are pleased
> > to announce that he has accepted.
> >
> > Nicholas has made numerous contributions including the OneNoteParser,
> > and, more recently, the Solr pipes modules.  We look forward to continued
> > collaboration to make Tika more robust and scaleable.
> >
> > Being a committer enables easier contribution to the
> > project since there is no need to go via the patch
> > submission process. This should enable better productivity.
> > Being a PMC member enables assistance with the management
> > and to guide the direction of the project.
> >
> > Welcome aboard, Nicholas!  Please share a bit about yourself.
> >
> > Cheers,
> >
> >Tim
> >
>


Re: new committer: Nicholas DiPiazza

2021-06-03 Thread Nicholas DiPiazza
Hi Everyone!

Happy to be one of the committers for Tika!

My name is Nicholas DiPiazza - I reside in Madison, Wisconsin USA. My name
is Sicilian in origin, and I look Italian... but I'm actually 50% Irish!

I started doing Tika contributions through my work at Lucidworks (
https://lucidworks.com) building connectors that grab content from various
data sources such as SharePoint, Google Drive, OneDrive, Alfresco, etc.,
parsing it using Apache Tika, and eventually indexing it into Solr.

I primarily do back-end Java work but also do work in various languages and
frameworks. Most recently I have been doing a lot in Scala and Spark.

I have been having a lot of fun making Tika work at a massive scale inside
Kube containers! I scraped together a homemade version of the Tika Pipes
project in Tika 1.x to meet some needs I had, and then later collaborated
with Tim Allison to get this into Tika 2.x. Super stoked to get this in a
major version of Tika.

Other stuff about me:

I play drums for a metal band called Wake and Prevail
https://www.reverbnation.com/wakeandprevail although the Covid situation
has put music on hold indefinitely, I still jam to albums in my basement
regularly.

I play Starcraft 2 in my spare time, but am stuck in the Diamond League as
I don't want to hurt my fingers/wrists getting my APM any higher.

I prefer Ubuntu, Windows then Mac in that order. My Mac is actually in a
box back from when I moved and I have managed not to need it for several
months now.


Looking forward to doing even more contributions throughout the next couple
years, in particular improving our DWG support and improving the OneNote
parsing. And hoping to create Tika Pipes tutorials hopefully to help get
lots of people using that feature so we can get lots of contributions to
improve it.

Thanks!
-Nichiolas

On Thu, Jun 3, 2021 at 1:18 PM Tim Allison  wrote:

> The Project Management Committee (PMC) for Apache Tika
> has invited Nicholas DiPiazza to become a committer and we are pleased
> to announce that he has accepted.
>
> Nicholas has made numerous contributions including the OneNoteParser,
> and, more recently, the Solr pipes modules.  We look forward to continued
> collaboration to make Tika more robust and scaleable.
>
> Being a committer enables easier contribution to the
> project since there is no need to go via the patch
> submission process. This should enable better productivity.
> Being a PMC member enables assistance with the management
> and to guide the direction of the project.
>
> Welcome aboard, Nicholas!  Please share a bit about yourself.
>
> Cheers,
>
>Tim
>


[jira] [Commented] (TIKA-3436) Add multi-release for 2.x

2021-06-03 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17356655#comment-17356655
 ] 

Hudson commented on TIKA-3436:
--

SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #256 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/256/])
TIKA-3436 (tallison: 
[https://github.com/apache/tika/commit/4a0a4903f2984d1e5a1c886e2b8bb2343c0c3ef0])
* (edit) tika-app/pom.xml
* (edit) tika-eval/tika-eval-app/pom.xml
* (edit) tika-server/tika-server-standard/pom.xml


> Add multi-release for 2.x
> -
>
> Key: TIKA-3436
> URL: https://issues.apache.org/jira/browse/TIKA-3436
> Project: Tika
>  Issue Type: Bug
>Reporter: Tim Allison
>Priority: Major
> Fix For: 2.0.0
>
>
> After upgrading to log4j2, we're getting {{WARNING: 
> sun.reflect.Reflection.getCallerClass is not supported. This will impact 
> performance.}}.  This kills async and pipes because the warning is written to 
> stdout at least in the sub process. Adding multi release fixes this problem.
> {noformat}
>   
> true
>   
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TIKA-3437) Deprecate experimental Preflight pdf parser in 1.x and remove it from 2.x

2021-06-03 Thread Tim Allison (Jira)
Tim Allison created TIKA-3437:
-

 Summary: Deprecate experimental Preflight pdf parser in 1.x and 
remove it from 2.x
 Key: TIKA-3437
 URL: https://issues.apache.org/jira/browse/TIKA-3437
 Project: Tika
  Issue Type: Bug
Reporter: Tim Allison


The PDFBox project has voted to end support for the Preflight parser.  I don't 
think it is used much.  I think we can safely deprecate it in 1.x and remove it 
in 2.x.

https://lists.apache.org/thread.html/rab5e988eda580b1b9fb1653b86e3eaf524cf72a946af027112eed0a1%40%3Cdev.pdfbox.apache.org%3E



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


new committer: Nicholas DiPiazza

2021-06-03 Thread Tim Allison
The Project Management Committee (PMC) for Apache Tika
has invited Nicholas DiPiazza to become a committer and we are pleased
to announce that he has accepted.

Nicholas has made numerous contributions including the OneNoteParser,
and, more recently, the Solr pipes modules.  We look forward to continued
collaboration to make Tika more robust and scaleable.

Being a committer enables easier contribution to the
project since there is no need to go via the patch
submission process. This should enable better productivity.
Being a PMC member enables assistance with the management
and to guide the direction of the project.

Welcome aboard, Nicholas!  Please share a bit about yourself.

Cheers,

   Tim


[jira] [Updated] (TIKA-3435) Allow fetchers only when enableUnsecureFeatures is true in tika-server 2.x

2021-06-03 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison updated TIKA-3435:
--
Priority: Minor  (was: Major)

> Allow fetchers only when enableUnsecureFeatures is true in tika-server 2.x
> --
>
> Key: TIKA-3435
> URL: https://issues.apache.org/jira/browse/TIKA-3435
> Project: Tika
>  Issue Type: Bug
>Reporter: Tim Allison
>Assignee: Tim Allison
>Priority: Minor
> Fix For: 2.0.0
>
>
> For those using the legacy endpoints: /tika /rmeta, etc., they may want to 
> configure file system fetchers but no emitters.  We should not assume that 
> users will want fetchers _and_ emitters when setting "enableUnsecureFeatures".



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TIKA-3435) Allow fetchers only when enableUnsecureFeatures is true in tika-server 2.x

2021-06-03 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17356624#comment-17356624
 ] 

Hudson commented on TIKA-3435:
--

SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #255 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/255/])
TIKA-3435 -- allow fetchers only when enableUnsecureFeatures=true (tallison: 
[https://github.com/apache/tika/commit/54e6b8c96baa32919165a3bff735633acf827ea8])
* (edit) 
tika-server/tika-server-core/src/main/java/org/apache/tika/server/core/resource/AsyncResource.java
* (edit) 
tika-server/tika-server-core/src/test/java/org/apache/tika/server/core/TikaServerConfigTest.java
* (add) 
tika-server/tika-server-core/src/test/resources/configs/tika-config-server-fetchers-emitters.xml
* (edit) 
tika-server/tika-server-core/src/main/java/org/apache/tika/server/core/TikaServerProcess.java
* (edit) 
tika-server/tika-server-core/src/main/java/org/apache/tika/server/core/TikaServerConfig.java


> Allow fetchers only when enableUnsecureFeatures is true in tika-server 2.x
> --
>
> Key: TIKA-3435
> URL: https://issues.apache.org/jira/browse/TIKA-3435
> Project: Tika
>  Issue Type: Bug
>Reporter: Tim Allison
>Assignee: Tim Allison
>Priority: Major
> Fix For: 2.0.0
>
>
> For those using the legacy endpoints: /tika /rmeta, etc., they may want to 
> configure file system fetchers but no emitters.  We should not assume that 
> users will want fetchers _and_ emitters when setting "enableUnsecureFeatures".



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TIKA-3434) Document removal of urlenabledinputstream in 2.x

2021-06-03 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17356600#comment-17356600
 ] 

Tim Allison commented on TIKA-3434:
---

To get that example to work, in the /tika endpoint, you'll need to rebuild/pull 
the latest with the recent fixes.  Sorry and thank you!

> Document removal of urlenabledinputstream in 2.x
> 
>
> Key: TIKA-3434
> URL: https://issues.apache.org/jira/browse/TIKA-3434
> Project: Tika
>  Issue Type: Bug
>Reporter: Tim Allison
>Priority: Minor
>
> In 2.x, I removed the URLEnabledInputStream in favor of a file fetcher.  We 
> need to add this to breaking changes in the CHANGES.txt file and document the 
> change on our wiki.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TIKA-3434) Document removal of urlenabledinputstream in 2.x

2021-06-03 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved TIKA-3434.
---
Fix Version/s: 2.0.0
   Resolution: Fixed

> Document removal of urlenabledinputstream in 2.x
> 
>
> Key: TIKA-3434
> URL: https://issues.apache.org/jira/browse/TIKA-3434
> Project: Tika
>  Issue Type: Bug
>Reporter: Tim Allison
>Priority: Minor
> Fix For: 2.0.0
>
>
> In 2.x, I removed the URLEnabledInputStream in favor of a file fetcher.  We 
> need to add this to breaking changes in the CHANGES.txt file and document the 
> change on our wiki.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TIKA-3436) Add multi-release for 2.x

2021-06-03 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved TIKA-3436.
---
Fix Version/s: 2.0.0
   Resolution: Fixed

> Add multi-release for 2.x
> -
>
> Key: TIKA-3436
> URL: https://issues.apache.org/jira/browse/TIKA-3436
> Project: Tika
>  Issue Type: Bug
>Reporter: Tim Allison
>Priority: Major
> Fix For: 2.0.0
>
>
> After upgrading to log4j2, we're getting {{WARNING: 
> sun.reflect.Reflection.getCallerClass is not supported. This will impact 
> performance.}}.  This kills async and pipes because the warning is written to 
> stdout at least in the sub process. Adding multi release fixes this problem.
> {noformat}
>   
> true
>   
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TIKA-3435) Allow fetchers only when enableUnsecureFeatures is true in tika-server 2.x

2021-06-03 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved TIKA-3435.
---
Fix Version/s: 2.0.0
 Assignee: Tim Allison
   Resolution: Fixed

> Allow fetchers only when enableUnsecureFeatures is true in tika-server 2.x
> --
>
> Key: TIKA-3435
> URL: https://issues.apache.org/jira/browse/TIKA-3435
> Project: Tika
>  Issue Type: Bug
>Reporter: Tim Allison
>Assignee: Tim Allison
>Priority: Major
> Fix For: 2.0.0
>
>
> For those using the legacy endpoints: /tika /rmeta, etc., they may want to 
> configure file system fetchers but no emitters.  We should not assume that 
> users will want fetchers _and_ emitters when setting "enableUnsecureFeatures".



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (TIKA-3432) Fix localization issue in OOXMLParserTest

2021-06-03 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved TIKA-3432.
---
Resolution: Fixed

Thank you [~tilman]!

> Fix localization issue in OOXMLParserTest
> -
>
> Key: TIKA-3432
> URL: https://issues.apache.org/jira/browse/TIKA-3432
> Project: Tika
>  Issue Type: Improvement
>  Components: parser
>Affects Versions: 2.0.0
>Reporter: Tim Allison
>Priority: Minor
>  Labels: localization
>
> [~tilman] reported this locale issue exception:
> {noformat}
> testXLSBVarious(org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest)
> Time elapsed: 0.153 sec  <<< FAILURE!
> java.lang.AssertionError: 13.1211231321 not found in:
> float 13,1211231321
> 
> I guess this is a localization problem. The test code itself is from
> 2017, so the problem is probably elsewhere.
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TIKA-3434) Document removal of urlenabledinputstream in 2.x

2021-06-03 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17356595#comment-17356595
 ] 

Tim Allison commented on TIKA-3434:
---

https://cwiki.apache.org/confluence/display/TIKA/tika-pipes#tikapipes-FetchersInClassicServerEndpoints

> Document removal of urlenabledinputstream in 2.x
> 
>
> Key: TIKA-3434
> URL: https://issues.apache.org/jira/browse/TIKA-3434
> Project: Tika
>  Issue Type: Bug
>Reporter: Tim Allison
>Priority: Minor
>
> In 2.x, I removed the URLEnabledInputStream in favor of a file fetcher.  We 
> need to add this to breaking changes in the CHANGES.txt file and document the 
> change on our wiki.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TIKA-3436) Add multi-release for 2.x

2021-06-03 Thread Tim Allison (Jira)
Tim Allison created TIKA-3436:
-

 Summary: Add multi-release for 2.x
 Key: TIKA-3436
 URL: https://issues.apache.org/jira/browse/TIKA-3436
 Project: Tika
  Issue Type: Bug
Reporter: Tim Allison


After upgrading to log4j2, we're getting {{WARNING: 
sun.reflect.Reflection.getCallerClass is not supported. This will impact 
performance.}}.  This kills async and pipes because the warning is written to 
stdout at least in the sub process. Adding multi release fixes this problem.

{noformat}
  
true
  
{noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (TIKA-3432) Fix localization issue in OOXMLParserTest

2021-06-03 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17356542#comment-17356542
 ] 

Tilman Hausherr commented on TIKA-3432:
---

Yes. I thought that was the branch for the 2.0 release.

You can close the issue. The test mentioned works in the 2.0.0 beta (another 
test fails, but doesn't in main, so it's probably been fixed).

> Fix localization issue in OOXMLParserTest
> -
>
> Key: TIKA-3432
> URL: https://issues.apache.org/jira/browse/TIKA-3432
> Project: Tika
>  Issue Type: Improvement
>  Components: parser
>Affects Versions: 2.0.0
>Reporter: Tim Allison
>Priority: Minor
>  Labels: localization
>
> [~tilman] reported this locale issue exception:
> {noformat}
> testXLSBVarious(org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest)
> Time elapsed: 0.153 sec  <<< FAILURE!
> java.lang.AssertionError: 13.1211231321 not found in:
> float 13,1211231321
> 
> I guess this is a localization problem. The test code itself is from
> 2017, so the problem is probably elsewhere.
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TIKA-3435) Allow fetchers only when enableUnsecureFeatures is true in tika-server 2.x

2021-06-03 Thread Tim Allison (Jira)
Tim Allison created TIKA-3435:
-

 Summary: Allow fetchers only when enableUnsecureFeatures is true 
in tika-server 2.x
 Key: TIKA-3435
 URL: https://issues.apache.org/jira/browse/TIKA-3435
 Project: Tika
  Issue Type: Bug
Reporter: Tim Allison


For those using the legacy endpoints: /tika /rmeta, etc., they may want to 
configure file system fetchers but no emitters.  We should not assume that 
users will want fetchers _and_ emitters when setting "enableUnsecureFeatures".



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (TIKA-3434) Document removal of urlenabledinputstream in 2.x

2021-06-03 Thread Tim Allison (Jira)
Tim Allison created TIKA-3434:
-

 Summary: Document removal of urlenabledinputstream in 2.x
 Key: TIKA-3434
 URL: https://issues.apache.org/jira/browse/TIKA-3434
 Project: Tika
  Issue Type: Bug
Reporter: Tim Allison


In 2.x, I removed the URLEnabledInputStream in favor of a file fetcher.  We 
need to add this to breaking changes in the CHANGES.txt file and document the 
change on our wiki.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (TIKA-3432) Fix localization issue in OOXMLParserTest

2021-06-03 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17356323#comment-17356323
 ] 

Tim Allison edited comment on TIKA-3432 at 6/3/21, 10:30 AM:
-

Ha. Ok. {{main}} is the development branch for 2.x. 2.x is the original draft 
by Bob Paulin. Let’s rename that branch.

Or, did you get this failure on the 2.0.0-BETA tag/src?


was (Author: talli...@mitre.org):
Ha. Ok. {{main}} is the development branch for 2.x. 2.x is the original draft 
by Bob Paulin. Let’s rename that branch.

> Fix localization issue in OOXMLParserTest
> -
>
> Key: TIKA-3432
> URL: https://issues.apache.org/jira/browse/TIKA-3432
> Project: Tika
>  Issue Type: Improvement
>  Components: parser
>Affects Versions: 2.0.0
>Reporter: Tim Allison
>Priority: Minor
>  Labels: localization
>
> [~tilman] reported this locale issue exception:
> {noformat}
> testXLSBVarious(org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest)
> Time elapsed: 0.153 sec  <<< FAILURE!
> java.lang.AssertionError: 13.1211231321 not found in:
> float 13,1211231321
> 
> I guess this is a localization problem. The test code itself is from
> 2017, so the problem is probably elsewhere.
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)