On Tue, Apr 12, 2022 at 5:24 PM Francesco Pretto <cez...@gmail.com> wrote:

> On Tue, 12 Apr 2022 at 14:50, zyx <z...@gmx.us> wrote:
> > there exists a text extract tool [1], which is supposed to, well, extract
> > text from the PDF files.
> > [1]
> https://sourceforge.net/p/podofo/code/HEAD/tree/podofo/branches/PODOFO_0_9_7_BRANCH/tools/podofotxtextract/
> >
>
> Correct: albeit many text related operators are not handled, that is
> the code to look in PoDoFo.
>
>
Just note that text position really does not depend on "m" or "l" operators
like that code may misleadingly suggest (correct me if I am wrong):

            if( strcmp( pszToken, "l" ) == 0 ||
                strcmp( pszToken, "m" ) == 0 )
            {
                if( stack.size() == 2 )
                {
                    dCurPosX = stack.top().GetReal();
                    stack.pop();
                    dCurPosY = stack.top().GetReal();
                    stack.pop();


> Cheers,
> Francesco
>
>
> _______________________________________________
> Podofo-users mailing list
> Podofo-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/podofo-users
>
_______________________________________________
Podofo-users mailing list
Podofo-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/podofo-users

Reply via email to