[
https://issues.apache.org/jira/browse/UIMA-5135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marshall Schor updated UIMA-5135:
---------------------------------
Description:
A recent Jira UIMA-5120 was logging CASs to file system directories, and
including a type system.
It would be good to have a conventional,supported way to do this common kind of
operation, added to CasIOUtils.
Additionally, it would be good to support as an alternative the standard XML
serialization format for type systems.
Some possible conventions:
* multiple cas files, in 1 directory, with one additional file with the name
"typesystem.xml".
* the above style, in one zip file (for example, to be able to read it, one cas
at a time, via some iterator).
* finding a type system via the class path following uimaFIT conventions
One factor that probably is important is to store the type system for this kind
of thing "close to" the serialized forms it applies to.
It would be possible of course to support multiple conventions. However, the
more conventions, the less benefit from "standardization", so this ought to be
a balance.
was:
A recent Jira UIMA-5120 was logging CASs to file system directories, and
including a type system.
It would be good to have a conventional,supported way to do this common kind of
operation, added to CasIOUtils.
Additionally, it would be good to support as an alternative the standard XML
serialization format for type systems.
Some possible conventions:
* multiple cas files, in 1 directory, with one additional file with the name
"typesystem.xml".
* the above style, in one zip file (for example, to be able to read it, one cas
at a time, via some iterator).
* finding a type system via the class path following uimaFIT conventions
It would be possible of course to support multiple conventions. However, the
more conventions, the less benefit from "standardization", so this ought to be
a balance.
> UIMA CasIOUtils enhancements in handling type systems
> -----------------------------------------------------
>
> Key: UIMA-5135
> URL: https://issues.apache.org/jira/browse/UIMA-5135
> Project: UIMA
> Issue Type: Improvement
> Components: Core Java Framework
> Reporter: Marshall Schor
> Priority: Minor
> Fix For: 3.0.0SDKexp, 2.9.1SDK
>
>
> A recent Jira UIMA-5120 was logging CASs to file system directories, and
> including a type system.
> It would be good to have a conventional,supported way to do this common kind
> of operation, added to CasIOUtils.
> Additionally, it would be good to support as an alternative the standard XML
> serialization format for type systems.
> Some possible conventions:
> * multiple cas files, in 1 directory, with one additional file with the name
> "typesystem.xml".
> * the above style, in one zip file (for example, to be able to read it, one
> cas at a time, via some iterator).
> * finding a type system via the class path following uimaFIT conventions
> One factor that probably is important is to store the type system for this
> kind of thing "close to" the serialized forms it applies to.
> It would be possible of course to support multiple conventions. However, the
> more conventions, the less benefit from "standardization", so this ought to
> be a balance.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)