[
https://issues.apache.org/jira/browse/TIKA-1880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16591986#comment-16591986
]
Michael Standfuss commented on TIKA-1880:
-----------------------------------------
Currently experiencing this same issue. [[email protected]] do you know if
there was ever any code written to fix the issue or did you implement any kind
of workaround for it?
> Attribute number-columns-repeated not correctly used in ODS documents
> ---------------------------------------------------------------------
>
> Key: TIKA-1880
> URL: https://issues.apache.org/jira/browse/TIKA-1880
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.12
> Reporter: Ryan Desmond
> Priority: Minor
> Labels: LibreOffice
> Attachments: example.ods
>
>
> When the ODS writer has first written, it made the assumption that the the
> `number-columns-repeated` attribute for cells would only be used for blank
> cells. This is not the case with documents created by (at least) LibreOffice
> 4.4.7.2. The current work approach to repeated cells is to use the html
> concept of spanning, which is not suitable for repeated content.
> The note in the Tika source (OpenDocumentContentParser.java#L459):
> TODO: The following is not correct, the cell should be repeated not spanned!
> Code generates a HTML cell, spanning all repeated columns, to make the cell
> look correct. Problems may occur when both spanning and repeating is given,
> which is not allowed by spec. Cell spanning instead of repeating is not a
> problem, because OpenOffice uses it only for empty cells.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)