I have tested, and it works just fine (at least with the default text extraction strategy). Maybe the problem is that you are using the Simple strategy? That isn't really used very much (in fact, it's really in the code more as a proof of concept more than anything that should ever actually be used).
PdfContentReaderTool "path/to/file.pdf" stdout 1 results in: ==============Page 1==================== - - - - - Dictionary - - - - - - (/Parent=Dictionary of type: /Pages, /Type=/Page, /Contents=Stream, /Resources=Dictionary, /MediaBox=[0, 0, 595, 842]) Subdictionary /Parent = (/ITXT=5.0.2, /Type=/Pages, /Count=2, /Kids=[18 0 R, 11 0 R]) Subdictionary /Resources = (/ProcSet=[/PDF, /Text, /ImageB, /ImageC, /ImageI], /XObject=Dictionary, /Font=Dictionary) Subdictionary /XObject = (/img0=Stream of type: /XObject) Subdictionary /Font = (/F1=Dictionary of type: /Font, /F2=Dictionary of type: /Font) Subdictionary /F1 = (/BaseFont=/Helvetica, /Type=/Font, /Subtype=/Type1, /Encoding=/WinAnsiEncoding) Subdictionary /F2 = (/BaseFont=/Helvetica-Bold, /Type=/Font, /Subtype=/Type1, /Encoding=/WinAnsiEncoding) - - - - - XObject Summary - - - - - - ------ /img0 - subtype = /Image = 5815 bytes ------ - - - - - Content Stream - - - - - - << removed >> - - - - - Text Extraction - - - - - - 19252 EMPID:16477 Form No. 16 {See Rule 31(1)(a) } PART A Certificate under section 203 of the Income-tax Act, 1961 for Tax deducted at source from Salary Name and address of the employer Name and designation of the employee ORACLE INDIA PRIVATE LIMITED Vikash Mittal India Development Centre Project Leader Oracle Technology Park, 3, Bannerghatta Road, Bangalore KARNATAKA INDIA PAN of the Deductor TAN of the Deductor PAN of the Employee AAACO0158L BLRO00194F AIOPM2060N CIT(TDS) Period Assessment Year CIT(TDS) 18(1) FROM TO No.59, 4th Floor , HMT Bhavan, Ganganagar 2012-2013 01-Apr-2011 31-Mar-2012 Bangalore 560032 Summary of tax deducted at source Quarter Receipt Numbers of original statement of TDS under sub-section Amount of tax deducted Amount of tax (3) of section 200 in respect of the deposited/remitted in employee respect of the employee Quarter 1 CUUXXLAB 4,314.00 4,314.00 Quarter 2 CUTXBZCB 14,027.00 14,027.00 Quarter 3 CUVXFYFC 11,627.00 11,627.00 Quarter 4 CUSXUFZB 9,127.00 9,127.00 Total 39,095.00 39,095.00 PART B(Refer Note 1) Details of Salary Paid and any other income and tax deducted 1.Gross Salary Rs. Rs. Rs. a)Salary as per provisions contained in sec.17(1) 993,163.13 b)Value of perquisites u/s 17(2) (as per Form No.12BA, wherever applicable) 0.00 c)Profits in lieu of salary under section 17(3) (as per Form No.12BA, wherever applicable) 0.00 d)Total 993,163.13 2.Less :Allowance to the extent exempt u/s 10 Allowance Rs. Conveyance 9,600.00 L T A Non Taxable 23,320.00 House Rent Allowance 156,922.00 189,842.00 3.Balance(1-2) 803,321.13 4.Deductions : a)Entertainment allowance 0.00 b)Tax on employment 2,400.00 5.Aggregate of 4(a) and 4(b) 2,400.00 6.Income chargeable under the head 'Salaries'(3-5) 800,921.13 7.Add: Any other income reported by the employee Income Rs. Loss From House Property -147,067.60 -147,067.60 8.Gross Total Income(6+7) 653,853.53 -- View this message in context: http://itext-general.2136553.n4.nabble.com/Exception-during-parsing-pdf-tp4659887p4659894.html Sent from the iText - General mailing list archive at Nabble.com. ------------------------------------------------------------------------------ _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php