We are using Libre Office in service/server mode and pushing documents with an api that escapes me atm... just woke up. But point, the conversion with Libreoffice is a
I am working on the raw xml and solved this yesterday, but my question was largely in relation to how to leverage POI to access the data via OPCPackage, I was having a difficult time understanding how to access the data (and ultimately relied on IOUtils because of some funkiness. On Apr 28, 2015 4:05 AM, "Mark Beardsley" <[email protected]> wrote: > Perhaps a little risky, but what is preventing you from working directly on > the raw XML? The Office file is simply a zipped archive containing various > folders and files. If you know where the 'offending' markup is, what is to > prevent you from unzipping the archive, manipulating the XML either > directly > or using a read/write parser and then zipping the archive back up again. > > I am also compelled to ask, what happens if you use Office to convert one > of > the offending files? How does the markup this produces differ from the > LibreOffice produces? > > > > -- > View this message in context: > http://apache-poi.1045710.n5.nabble.com/Remove-Invalid-XML-in-DOCX-tp5718602p5718619.html > Sent from the POI - User mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > >
