[
https://issues.apache.org/jira/browse/UIMA-4685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15411998#comment-15411998
]
Richard Eckart de Castilho commented on UIMA-4685:
--------------------------------------------------
I don't think the performance problem per se has been addressed. It can be kind
of being worked around by loading bcas based on Java serialization. But the
underlying problem afaik still exists.
[~schor], I think the latest state on the performance problem is still recorded
in the mailing list archive: http://markmail.org/thread/ezr4j6baoxvh2wqu
I think we didn't get past this (quoting [~schor]):
{quote}
It does look like there's a performance issue if not allowing duplicate adds,
for
exactly the use case you mentioned: lots of FSs which compare "equal" according
to the sorted index keys, but which are not the same FS.
This can be fixed I think.
{quote}
> Add Common Load/Save APIs, including type system and index defs, for various
> CAS formats
> ----------------------------------------------------------------------------------------
>
> Key: UIMA-4685
> URL: https://issues.apache.org/jira/browse/UIMA-4685
> Project: UIMA
> Issue Type: New Feature
> Components: CasEditor, Core Java Framework
> Affects Versions: 2.8.1SDK
> Reporter: Peter Klügl
> Assignee: Peter Klügl
> Fix For: 2.9.0SDK
>
> Attachments: BinaryCasIOUtils.java, caseditor.csv,
> caseditor_bincas.csv
>
>
> Create a new CasIOUtils class having common load and save APIs, with optional
> saving of type system and index definitions, in all the various CAS
> serialization formats (except JSON). Implement heuristic discovery for
> deserialization to support automatic selection of the right kind of
> deserializer. Update the headers for binary to support indicating they have
> the type system included (not used yet). Update the SerialFormat to support
> this. Update the CasEditor and other components to use this new API.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)