[ 
https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15085946#comment-15085946
 ] 

Tim Allison edited comment on TIKA-1824 at 1/6/16 6:08 PM:
-----------------------------------------------------------

[~bobpaulin], this is an awesome step forward.  Must have been a fair amount of 
work. Thank you!

Few questions...not just for you, but for all.  I'm happy to submit/commit 
patches, but I want to make sure I don't do anything objectionable to the 
community

* This is probably user error, but I'm getting: \[ERROR\] Failed to execute 
goal org.apache.maven.plugins:maven-dependency-plugin:2.10:unpack (unpack) on 
project tika-text-module: Unable to find artifact. Could not find artifact 
org.apache.tika:tika-test-resources:jar:tests:2.0-SNAPSHOT in apache.snapshots 
(http://repository.apache.org/snapshots)
* Perhaps rename artifact names in parser sub-components to include 
"Parser(s?)", e.g. Apache Tika Parser Advanced Module so that the names sort 
more clearly (at least in the maven window in Intellij)?
* Perhaps add "parser(s?) to the artifactId, e.g. tika-parser-cad-module
* Perhaps lowercase names in parser-subcomponents so that they're inline with 
legacy: "Apache Tika parser advanced module"
* Pkcs7Parser ... should that be under advanced...or somewhere else ...own 
crypto package?
* iwork ...should we move that to office?
* tika-test-resources...should we move TikaTest into that and change the name 
to tika-test?  I have a vague memory of wanting to carve out a separate test 
package earlier and adding TikaTest and something else...
* OutlookPSTParser...move that to office?  
* Does MBox belong in web?  Not sure where to put it?
* Move CommonsDigester to core _if_ we're willing to add a dependency on 
commons-codec into core?
* Move Activator to tika-bundle?
* Move pot to multimedia or add tika-parsers-multimedia-advanced-module?
* Move geo.topic to "advanced"...perhaps we rename "advanced" to ner?
* Move ctakes to "advanced/ner"?
* Collapse web and text?

Again, this is fantastic.  Thank you!




was (Author: [email protected]):
[~bobpaulin], this is an awesome step forward.  Must have been a fair amount of 
work. Thank you!

Few questions...not just for you, but for all.  I'm happy to submit/commit 
patches, but I want to make sure I don't do anything objectionable to the 
community

* This is probably user error, but I'm getting: \[ERROR\] Failed to execute 
goal org.apache.maven.plugins:maven-dependency-plugin:2.10:unpack (unpack) on 
project tika-text-module: Unable to find artifact. Could not find artifact 
org.apache.tika:tika-test-resources:jar:tests:2.0-SNAPSHOT in apache.snapshots 
(http://repository.apache.org/snapshots)
* Perhaps rename artifact names in parser sub-components to include 
"Parser(s?)", e.g. Apache Tika Parser Advanced Module so that the names sort 
more clearly (at least in the maven window in Intellij)?
* Perhaps add "parser(s?) to the artifactId, e.g. tika-parser-cad-module
* Perhaps lowercase names in parser-subcomponents so that they're inline with 
legacy: "Apache Tika parser advanced module"
* Pkcs7Parser ... should that be under advanced...or somewhere else ...own 
crypto package?
* iwork ...should we move that to office?
* tika-test-resources...should we move TikaTest into that and change the name 
to tika-test?  I have a vague memory of wanting to carve out a separate test 
package earlier and adding TikaTest and something else...
* OutlookPSTParser...move that to office?  
* Does MBox belong in web?  Not sure where to put it?
* Move CommonsDigester to core _if_ we're willing to add a dependency on 
commons-digest into core?
* Move Activator to tika-bundle?
* Move pot to multimedia or add tika-parsers-multimedia-advanced-module?
* Move geo.topic to "advanced"...perhaps we rename "advanced" to ner?
* Move ctakes to "advanced/ner"?
* Collapse web and text?

Again, this is fantastic.  Thank you!



> Tika 2.0 -  Create Initial Parser Modules
> -----------------------------------------
>
>                 Key: TIKA-1824
>                 URL: https://issues.apache.org/jira/browse/TIKA-1824
>             Project: Tika
>          Issue Type: Improvement
>    Affects Versions: 2.0
>            Reporter: Bob Paulin
>            Assignee: Bob Paulin
>
> Create initial break down of parser modules.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to