On Tue, Apr 12, 2022 at 5:24 PM Francesco Pretto <cez...@gmail.com> wrote:
> On Tue, 12 Apr 2022 at 14:50, zyx <z...@gmx.us> wrote: > > there exists a text extract tool [1], which is supposed to, well, extract > > text from the PDF files. > > [1] > https://sourceforge.net/p/podofo/code/HEAD/tree/podofo/branches/PODOFO_0_9_7_BRANCH/tools/podofotxtextract/ > > > > Correct: albeit many text related operators are not handled, that is > the code to look in PoDoFo. > > Just note that text position really does not depend on "m" or "l" operators like that code may misleadingly suggest (correct me if I am wrong): if( strcmp( pszToken, "l" ) == 0 || strcmp( pszToken, "m" ) == 0 ) { if( stack.size() == 2 ) { dCurPosX = stack.top().GetReal(); stack.pop(); dCurPosY = stack.top().GetReal(); stack.pop(); > Cheers, > Francesco > > > _______________________________________________ > Podofo-users mailing list > Podofo-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/podofo-users >
_______________________________________________ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users