Thanks for your information. I am getting them to send me the document in XML and EPUB format and will look into it.
I will keep you posted. Greg --- On Thu, 5/8/10, George, Andre (Dr) <[email protected]> wrote: > From: George, Andre (Dr) <[email protected]> > Subject: RE: [phpug]Moving Indesign file information into PHP application > To: [email protected] > Date: Thursday, 5 August, 2010, 12:35 > Hi Greg > > Let me know if you can solve this. Our org is looking at > doing something similar - so we would REALLY like to use any > solutions you can come > up with. > > From my preliminary analysis, I found the problem not > insignificant and highly dependent on how structured the > Indesign typography was i.e. > tags al a CSS. > > I was looking at a combination of parsing both html and xml > exports (Indesign CS2 and CS4) > > The html gives the images nicely whereas in xml images are > encoded (MIME /base64 style) > The xml has better record of actual document structure > layout > > In summary, if the original Indesign doc is not structured > i.e. custom layout and not a formal DTP type tagguing > typography, the task is > rather diffcult to achieve accurate parsing. A good test is > whether the Indesign can export to ePUb with relative > accuracy > > PS Adobe - evade the issue on this one - their idea of > Indesign to Web is via a Flash doc > > andre > > > > -----Original Message----- > From: [email protected] > [mailto:[email protected]] > On Behalf Of gregor brabyn > Sent: Thursday, 5 August 2010 11:36 > To: [email protected] > Subject: [phpug]Moving Indesign file information into PHP > application > > Hi > > I am quoting on making a document management system that > has highly hierarchial structured data. The document is > mostly text but does > contain images and data in tables. > > Currently the document is maintained in Indesign CS2 and I > understand that Indesign can export the data as XML or as a > PDF. > > The client has asked for a quote, among things, on > transferring the document from Indesign to the new PHP > system I would build. > > Has anyone had any experience with anything like this. With > an XML version I am not sure how I would deal with loading > the images. With a > PDF version I am not sure if there are any PHP based PDF > parsers that would keep the hierarchial structure intact. > Not even sure if the PDF > internally keeps this data structure. > > Does anyone have any advice on this. It is currently a > large document that in PDF format has something like one and > a half thousand pages. > Is it going to be better to just use a data entry > operator? > > Greg > > > > > -- > NZ PHP Users Group: http://groups.google.com/group/nzphpug > To post, send email to [email protected] > To unsubscribe, send email to > [email protected] > please consider the environment before you print this > email > > ########################################################################################## > > The views expressed in this e-mail and any corresponding > attachments do not necessarily > reflect those of the Health Research Council of New > Zealand. This e-mail together with any > accompanying attachments may be confidential and subject to > legal privilege. If you have > received this message in error, please notify the sender > immediately and note that you may > not copy, disclose or use the content in any way > > Thank You. > ########################################################################################## > > -- > NZ PHP Users Group: http://groups.google.com/group/nzphpug > To post, send email to [email protected] > To unsubscribe, send email to > [email protected] > -- NZ PHP Users Group: http://groups.google.com/group/nzphpug To post, send email to [email protected] To unsubscribe, send email to [email protected]
