We are using Libre Office in service/server mode and pushing documents with
an api that escapes me atm... just woke up.  But point, the conversion with
Libreoffice is a

I am working on the raw xml and solved this yesterday, but my question was
largely in relation to how to leverage POI to access the data via
OPCPackage, I was having a difficult time understanding how to access the
data (and ultimately relied on IOUtils because of some funkiness.
On Apr 28, 2015 4:05 AM, "Mark Beardsley" <[email protected]> wrote:

> Perhaps a little risky, but what is preventing you from working directly on
> the raw XML? The Office file is simply a zipped archive containing various
> folders and files. If you know where the 'offending' markup is, what is to
> prevent you from unzipping the archive, manipulating the XML either
> directly
> or using a read/write parser and then zipping the archive back up again.
>
> I am also compelled to ask, what happens if you use Office to convert one
> of
> the offending files? How does the markup this produces differ from the
> LibreOffice produces?
>
>
>
> --
> View this message in context:
> http://apache-poi.1045710.n5.nabble.com/Remove-Invalid-XML-in-DOCX-tp5718602p5718619.html
> Sent from the POI - User mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
>

Reply via email to