[
https://issues.apache.org/jira/browse/TIKA-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15167123#comment-15167123
]
Nick Burch commented on TIKA-1855:
----------------------------------
Currently, we have most test documents in Tika Parsers, and a handful in Tika
Core, which is sometimes confusing. We also end up with quite a lot of the unit
tests for Tika Core actually being in the Tika Parsers test area, so that they
can use the test documents in parsers which aren't in core. Based on my
experiences with this (eg where I start putting things in the wrong module,
initially can't find the right unit test etc), I find it non-ideal, and I
suspect it's not intuitive at all for new contributors.
For the Ogg Vorbis stuff I maintain, I've opted to put all of the test files
needed in {{core/src/test/resources}} then have the other maven modules (eg the
Tika one and the Tools one) depend on the core-test artifact as a test-scope
dependency in order for their unit tests to access the common set of test
files. I find this actually works quite well, now I have it set up, and it
seems ok for both InputStream and File based tests
So, given the above two, I would suggest that we put all of our test documents
from core, parsers, server and bundle (all of which seem to have their own ones
at the moment!) into a single artifact. We then depend on that artifact for all
of our tests, with a test scope
> TIka 2.0 - Move shared test-code back to tika-core and distribute test files
> to parser modules
> ----------------------------------------------------------------------------------------------
>
> Key: TIKA-1855
> URL: https://issues.apache.org/jira/browse/TIKA-1855
> Project: Tika
> Issue Type: Sub-task
> Reporter: Tim Allison
> Assignee: Tim Allison
>
> Undo TIKA-1851, and divide test docs to appropriate parser modules.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)