On Tue, 8 Sep 2009, Michal Hocko wrote: > On Tue, Sep 08, 2009 at 07:20:30PM +0200, Michal Hocko wrote: >> On Tue, Sep 08, 2009 at 05:10:05PM +0200, Thomas Spahni wrote: >>> On Mon, 7 Sep 2009, Jozef Misutka wrote: >>> >>>> >>>> done, >>>> >>>> check pdf_to_text.cpp in newest tools package in sourceforge downloads. >>>> (tools-Win32-20090907_1746.zip) or >>>> http://pdfedit.cvs.sourceforge.net/viewvc/pdfedit/pdfedit/src/tests/tools/pdf_to_text.cc?revision=1.1&view=markup >>>> >>>> /jozo >>> >>> Hi Jozef >>> >>> Thanks a lot!!! My progress is as follows: >>> - checked out the cvs repository >>> - did autoconf and compiled it (success) >>> - went to the directory src/tests/tools/ >>> - noticed that your pdf_to_text.cc is not (yet) in the Makefile >> >> Tools are not incorporated into our build system yet. This is just >> discussed in our devel mailing list but I assume that it will take some >> time until this will work. > > OK, it went better than I expected ;) Could you try the attached patch > series (please run cvs -q up -P -d before applying - there has been some > changes in the area). You will need to run autoconf to re-generate > configure script and then run ./configure --enable-tools [--disable-gui] > > [--disable-gui will (as name suggesting) prevevent from GUI compilation.] > > then you can go to the src directory and run make. Make sure that you > have done make clean before that. > > Btw. what kind of system do you use? (OS, Architecture, version of the > boost-program-options library). > > Thanks > -- > Michal Hocko
Hello Michal & Jozef Thank you very much. This works like a charm! Absolutely perfect. I took a fresh copy of the CVS, applied your patches, autoconf, configure, make --> build went ok. pdf_to_text works! (gui as well). There is one minor glitch: output of pdf_to_text is utf-8 (as documented) and when I recode that to latin1 with 'recode' it complains about non-valid input. Forcing it with 'recode -f' works. There must be some non-utf8 code in the *.txt file. It seems to stop at the following sequence in the text (hex): e2 96 a0. This could be a problem of the PDF-source, I don't know. I'm on a Linux install of SuSE-11.1 i686 32-bit with kernel 2.6.27.29-0.1-pae. Installed packages are from the standard repo (not yet updated). libboost_program_options is Ver. 1.36.0; gcc 4.3.2. Thomas ------------------------------------------------------------------------------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ Pdfedit-support mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/pdfedit-support
