The Apache Jenkins build system has built tika-trunk-jdk1.7 (build #733)
Status: Failure
Check console output at https://builds.apache.org/job/tika-trunk-jdk1.7/733/ to
view the results.
This was due to the SVN issues that infra was dealing
with last night.
I’ll go ahead and spin RC #2 shortly.
++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion
Hi Nick,
I've been mulling this over since you sent the first message. But, I'm
afraid I don't have a good solution or developed ideas.
I agree, it would be very nice to consolidate all configuration for all
parsers in the server and app.
Is it feasible to put everything into tika-config? Then
On Sat, 6 Jun 2015, Tyler Palsulich wrote:
(Devil's advocate hat slightly on.) My one hesitation about putting it
all into tika-config is that the default might get to be a monstrosity
-- difficult for new users to use.
Assuming you don't want any translators, and have no non-standard paths
[
https://issues.apache.org/jira/browse/TIKA-1652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14575993#comment-14575993
]
Chris A. Mattmann commented on TIKA-1652:
-
+1, agreed. I'll wrap them both up
[
https://issues.apache.org/jira/browse/TIKA-1652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14575986#comment-14575986
]
Tyler Palsulich commented on TIKA-1652:
---
I think this is a duplicate of TIKA-1426?
Hey Tyler,
I hear you, but balance that against all the hidden things here
and there, and everywhere, that I constantly keep discovering and
having to pour through lines of TikaConfig - service loaders, class
loaders.
When things work right - no problem. When something goes wrong;
HUGE waste of
[
https://issues.apache.org/jira/browse/TIKA-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14575996#comment-14575996
]
Chris A. Mattmann commented on TIKA-1645:
-
I got this working with both tika-app
[
https://issues.apache.org/jira/browse/TIKA-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann reassigned TIKA-1645:
---
Assignee: Chris A. Mattmann (was: Giuseppe Totaro)
Extraction of biomedical
[
https://issues.apache.org/jira/browse/TIKA-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann resolved TIKA-1645.
-
Resolution: Fixed
Fix Version/s: (was: 1.10)
1.9
[
https://issues.apache.org/jira/browse/TIKA-1642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann resolved TIKA-1642.
-
Resolution: Fixed
Fix Version/s: 1.9
Assignee: Chris A. Mattmann (was:
[
https://issues.apache.org/jira/browse/TIKA-1642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14576051#comment-14576051
]
Hudson commented on TIKA-1642:
--
ABORTED: Integrated in tika-trunk-jdk1.7 #734 (See
[
https://issues.apache.org/jira/browse/TIKA-1652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14576053#comment-14576053
]
Hudson commented on TIKA-1652:
--
ABORTED: Integrated in tika-trunk-jdk1.7 #734 (See
[
https://issues.apache.org/jira/browse/TIKA-1426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14576052#comment-14576052
]
Hudson commented on TIKA-1426:
--
ABORTED: Integrated in tika-trunk-jdk1.7 #734 (See
[
https://issues.apache.org/jira/browse/TIKA-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14576054#comment-14576054
]
Hudson commented on TIKA-1645:
--
ABORTED: Integrated in tika-trunk-jdk1.7 #734 (See
(Devil's advocate hat slightly on.) My one hesitation about putting it all
into tika-config is that the default might get to be a monstrosity --
difficult for new users to use.
Tyler
On Sat, Jun 6, 2015 at 3:48 PM Mattmann, Chris A (3980)
chris.a.mattm...@jpl.nasa.gov wrote:
I think it would
[
https://issues.apache.org/jira/browse/TIKA-1426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann resolved TIKA-1426.
-
Resolution: Fixed
Fix Version/s: (was: 1.10)
1.9
Hey Chris,
On 1 Jun 2015, at 06:38, Mattmann, Chris A (3980)
chris.a.mattm...@jpl.nasa.gov wrote:
Please vote on releasing this package as Apache Tika 1.9.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.
[ ] +1 Release this
Also the lovely thing here too is that since cTAKESParser is a
decorator for AutoDetectParser there is magical infinite recursion
if it’s enabled via SPI.
TODO: make this a LOT cleaner in 1.10+.
++
Chris Mattmann, Ph.D.
Chief
I think it would be great to have all this in the Tika Config.
The one thing then is to provide an example default config and
to make it *hugely* clear rather than all the levels of indirection
that we currently have going on which makes it super hard when
there is a config error (SPI, swallowing
[
https://issues.apache.org/jira/browse/TIKA-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14575997#comment-14575997
]
Chris A. Mattmann commented on TIKA-1645:
-
Documentation:
[
https://issues.apache.org/jira/browse/TIKA-1652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann resolved TIKA-1652.
-
Resolution: Fixed
- Fixed:
{noformat}
bash-3.2$ svn commit -m Fix for TIKA-1652,
Hi Folks,
A second candidate for the Tika 1.9 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
http://svn.apache.org/repos/asf/tika/tags/1.9-rc2/
The SHA1 checksum of the archive is
Chris A. Mattmann created TIKA-1652:
---
Summary: Tika Server should allow config file override from the
command line like Tika App
Key: TIKA-1652
URL: https://issues.apache.org/jira/browse/TIKA-1652
Anyone have any thoughts on this?
On Fri, 8 May 2015, Nick Burch wrote:
Hi All
This came up in TIKA-1623, but I thought it might be better brought out to
the list for discussion
To configure parsers on a per-document basis, such as setting PDF
spacing tolerances, or telling Tesseract what
25 matches
Mail list logo