Hi
On 11/30/2011 11:33 AM, filtered wrote:
We are trying to use LibreOffice 3.4.4 (MacOSX, Linux) in order to convert
a 40 MB DOCX file to HTML.
The size comes from several high-res images (reducing the resolution of the
images in advance is not an option)
ajung@blackmoon:~/tmp/x> libreoffice --convert-to html --headless
Prostatakrebs_P_1.0__konvert_ready_.docx
convert /home/ajung/tmp/x/Prostatakrebs_P_1.0__konvert_ready_.docx ->
/home/ajung/tmp/x/Prostatakrebs_P_1.0__konvert_ready_.html using XHTML
Writer File
Entity: line 5: error: xmlSAX2Characters: huge text node: out of memory
HniXw9zvjtHlRJAShhR3Jt+VtHjCpw71Ic1T+bQK5cvbAsKF50lZU7qDKevBzaj8odxElCKjKSAkuEgE
^
Entity: line 5: parser error : Extra content at the end of the document
HniXw9zvjtHlRJAShhR3Jt+VtHjCpw71Ic1T+bQK5cvbAsKF50lZU7qDKevBzaj8odxElCKjKSAkuEgE
^
Error: Please reverify input parameters...
pure virtual method called
terminate called without an active exception
The error is reproducable on Mac and Linux - both having 4 GB of RAM.
Is there some solution fixing this situation?
Mit freundlichen Grüßen,
Andreas Jung
Another possible work around is to extract the docx file using a zip
utility and then working directly with the xml and image files.
One a side note, the html will have a link to the image because images
and other content are stored in the text part of the file
--
Jay Lozier
[email protected]
--
For unsubscribe instructions e-mail to: [email protected]
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted