Hi

On 11/30/2011 11:33 AM, filtered wrote:
We are trying to use LibreOffice 3.4.4 (MacOSX, Linux) in order to convert
a 40 MB DOCX file to HTML.
The size comes from several high-res images (reducing the resolution of the
images in advance is not an option)


ajung@blackmoon:~/tmp/x>  libreoffice --convert-to html  --headless
Prostatakrebs_P_1.0__konvert_ready_.docx
convert /home/ajung/tmp/x/Prostatakrebs_P_1.0__konvert_ready_.docx ->
/home/ajung/tmp/x/Prostatakrebs_P_1.0__konvert_ready_.html using XHTML
Writer File
Entity: line 5: error: xmlSAX2Characters: huge text node: out of memory
HniXw9zvjtHlRJAShhR3Jt+VtHjCpw71Ic1T+bQK5cvbAsKF50lZU7qDKevBzaj8odxElCKjKSAkuEgE

    ^
Entity: line 5: parser error : Extra content at the end of the document
HniXw9zvjtHlRJAShhR3Jt+VtHjCpw71Ic1T+bQK5cvbAsKF50lZU7qDKevBzaj8odxElCKjKSAkuEgE

    ^
Error: Please reverify input parameters...
pure virtual method called
terminate called without an active exception

The error is reproducable on Mac and Linux - both having 4 GB of RAM.

Is there some solution fixing this situation?

Mit freundlichen Grüßen,
Andreas Jung

Another possible work around is to extract the docx file using a zip utility and then working directly with the xml and image files.

One a side note, the html will have a link to the image because images and other content are stored in the text part of the file

--
Jay Lozier
[email protected]


--
For unsubscribe instructions e-mail to: [email protected]
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted

Reply via email to