Am 02.02.2017 um 21:12 schrieb Pulkit Kapur:
I am getting just the headers:
"2016 IEEE/RSJ International Conference on Intelligent Robots and Systems
(IROS)
Daejeon Convention Center
October 9-14, 2016, Daejeon, Korea
978-1-5090-3761-2/16/$31.00 ©2016 IEEE 5324
5325
5326
5327
5328
5329
5330
5331
"
Did use the new file path:
javaaddpath('C:\Users\XXX\Downloads\New
folder\pdfParseDemo\PDFBox-app-2.0.4.jar')
javaaddpath('C:\Users\XXX\Downloads\New
folder\pdfParseDemo\FontBox-2.0.4.jar')

I don't know how matlab works. What I mean is to delete the old PDFBox and fontbox versions form all directories. Then recompile (if applicable) and redeploy your stuff.

If it still doesn't work, use a different directory or a different computer. I tested your file and PDFBox extracts quite a lot.

Tilman




On Thu, Feb 2, 2017 at 3:11 PM, Tilman Hausherr <[email protected]>
wrote:

Am 02.02.2017 um 20:26 schrieb Pulkit Kapur:

Thanks. Thats what i would expect to read.
Also thanks for pointing to the latest version. I pointed to the
pdfbox-app-2.0.4.jar and the fontbox-2.0.4.jar files.

Since i want to read over 1000 pdf documents programmatically in matlab, i
am not using the command line, but using the java library in matlab.
Not sure why i am still *not *getting the text using getText()
{code}
pdfdoc = org.pdfbox.pdmodel.PDDocument;
pdfdoc.close;
reader = org.pdfbox.util.PDFTextStripper;

% list all the pdf files in the current folder
% listing = dir('**/*.pdf');
listing = dir('*.pdf');

      pdfdoc = pdfdoc.load(fullfile(listing(i).folder,listing(i).name));
      pdfdoc.isEncrypted;

      %% text, with planty of padding
      pdfstr = reader.getText(pdfdoc);                 %#ok
      pdfdoc.close
{\code}


Are you getting nothing at all? Or just not all?

Make sure you cleaned your class path.


Tilman

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]




---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to