Have you tried using ghostscript to convert pdf to tif files instead? Example commands
gs -r600x600 -sDEVICE=tiffg4 -dFirstPage=106 -dLastPage=109 -o ./tulasi/tulasikrishna%00d.tif "TulasiPuja.pdf" for one tif per page gs -r600x600 -sDEVICE=tiffg4 -dFirstPage=126 -dLastPage=131 -o ./tulasi/tulasIviShNupUjA.tif "TulasiPuja.pdf" for multipage tif you can reduce resolution to -r300x300 ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Thu, Jun 8, 2017 at 7:25 PM, Hari.K <[email protected]> wrote: > Hi There, > > I sometimes receive an error - "Failed to create pix, this normally > occurs because the requested image size is too large, please check Standard > Error Output" when doing OCR on a bitmap image. > > > Below highlighted line is where it's breaking for me - > > Bitmap bitmap; > Spire.Pdf.PdfDocument document = new Spire.Pdf.PdfDocument(pdfPath); > > > for (int i = 0; i <= document.Pages.Count; i++) > { > bitmap = (Bitmap)document.SaveAsImage(i, > PdfImageType.Bitmap, 200, 200); // where 200 is the DPI which I am > setting for a bitmap image > ................... > ................. > > } > > More details on what I am trying to do here: > 1) Uploaded a PDF document which is of hardly 600KB > 2) Iterate through each PDF page and convert it into a BitMap image > 3) Then input this BitMap image to Tesseract for performing OCR > > Please note, I don't get this error often. Any ideas on why this error as > I do not receive this every time ? > > Looking forward for some inputs on this.. > > Thanks in Advance, > Hari > > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/dcfe7918-707b-4b56-9720-b3e39ae1a658% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/dcfe7918-707b-4b56-9720-b3e39ae1a658%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXaN-9w4LG_0SFrEGy7GnxQeJiDbn5E2J-Po6yBwRfdFA%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

