You can try pdftohtml[1] to get it into HTML format, from there it should be easier to convert into a document format you want using something like pandoc[2].
[1]: http://pdftohtml.sourceforge.net/ [2]: http://johnmacfarlane.net/pandoc/README.html On 18 July 2014 14:47, Terry Coles <d-...@hadrian-way.co.uk> wrote: > Hi, > > Does anyone know how I can use tools available in Linux to convert a PDF > file > to MS Word .doc or .docx format (or even to LibreOffice .odt)? > > I thought I could do it using LibreOffice, but it reads the PDF content as > if it > is a series of graphical objects with text labels. As a consequence, I can > only save it as .odg or export it to a graphical format. > > The problem is that we have a number of specifications in PDF format. We > need > to get them into an editable form (preferably word) because they need > translating. > > At work I tried the real thing (Adobe Writer), but it seriously mangles the > format, even when it works. > > The originals seem to have been created using a number of different tools; > some > were created in MS Word 2010, some PDFCreator (presumably from a Word > Source, > some with Acrobat Distiller and some by conversion from Postscript. Adobe > Writer was only able to save three out of five documents and they were not > very > good. > > -- > > Terry Coles > > > > -- > Next meeting: Bournemouth, Tuesday, 2014-08-05 20:00 > Meets, Mailing list, IRC, LinkedIn, ... http://dorset.lug.org.uk/ > New thread on mailing list: mailto:dorset@mailman.lug.org.uk > How to Report Bugs Effectively: http://goo.gl/4Xue > -- Andrew Montgomery-Hurrell Professional Geek Blog: http://darkliquid.co.uk Twitter: http://twitter.com/darkliquid Fiction: http://www.protagonize.com/author/darkliquid -- Next meeting: Bournemouth, Tuesday, 2014-08-05 20:00 Meets, Mailing list, IRC, LinkedIn, ... http://dorset.lug.org.uk/ New thread on mailing list: mailto:dorset@mailman.lug.org.uk How to Report Bugs Effectively: http://goo.gl/4Xue