I dimly recall doing this a few years ago. I think we ended up converting the document to X(HT)ML and then manually sorting out the images.
I guess the best approach depends on the ratio between text/images in your source document. R On 5 August 2010 11:35, gregor brabyn <[email protected]> wrote: > Hi > > I am quoting on making a document management system that has highly > hierarchial structured data. The document is mostly text but does contain > images and data in tables. > > Currently the document is maintained in Indesign CS2 and I understand that > Indesign can export the data as XML or as a PDF. > > The client has asked for a quote, among things, on transferring the > document from Indesign to the new PHP system I would build. > > Has anyone had any experience with anything like this. With an XML version > I am not sure how I would deal with loading the images. With a PDF version I > am not sure if there are any PHP based PDF parsers that would keep the > hierarchial structure intact. Not even sure if the PDF internally keeps this > data structure. > > Does anyone have any advice on this. It is currently a large document that > in PDF format has something like one and a half thousand pages. Is it going > to be better to just use a data entry operator? > > Greg > > > > > -- > NZ PHP Users Group: http://groups.google.com/group/nzphpug > To post, send email to [email protected] > To unsubscribe, send email to > [email protected]<nzphpug%[email protected]> -- NZ PHP Users Group: http://groups.google.com/group/nzphpug To post, send email to [email protected] To unsubscribe, send email to [email protected]
