Hi, yes I tried before pdftotext but the results of Okular are much better for my needs (parsing of pdf documents). I need to parse "csv" or "fixed length" like documents that are unfortunately in pdf format, if anyone has any suggestion on how to parse them without translating them to text...
> From: aa...@kde.org > To: okular-devel@kde.org > Date: Thu, 10 Nov 2011 13:45:34 +0100 > Subject: Re: [Okular-devel] Export from pdf to txt, invoking from the > command line > > A Dijous, 10 de novembre de 2011, filippo di natale vàreu escriure: > > Hi, > > I like very much how Okular exports pdf to txt keeping the correct spacing > > (doing the same with acrobat on windows gave no such clean results). Given > > that I cannot invoke okular from the command line to make a pdf to txt > > conversion (or so I seem to understand) which library okular uses to do its > > pdf to txt conversion? Or, if it is developed internally in the project, > > can it be used stand alone to make a command line pdf to txt converter, and > > which part of the source code should I look ? Thanks, > > No, okular does not have a export to text command line. It should not be > extremely difficult, but we do not have it yet. > > You can try to use pdftotext command line, it is not what okular uses but it > is known to be good enough in some cases. > > Albert > > > > > Filippo > _______________________________________________ > Okular-devel mailing list > Okular-devel@kde.org > https://mail.kde.org/mailman/listinfo/okular-devel
_______________________________________________ Okular-devel mailing list Okular-devel@kde.org https://mail.kde.org/mailman/listinfo/okular-devel